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ENZYME and SNP MARKER FOR DISEASE 

The present invention relates to an isolated nucleic acid sequence which encodes a 
novel protein, where the protein is a new member in a family of enzymes with 
5 peptidase activity. Also provided is the use of the nucleic acid sequence and/or 
protein in medicine and research, a method for diagnosing, or determining 
predisposition to disease, methods for preventing or treating disease, and kits for use 
in the methods and the use of the nucleic acid sequence, protein and inhibitors thereof 
in treating or preventing inflammatory diseases, and in screens for identifying new 
10 inhibitors. Also provided are nucleic acid expression vectors,' host cells, screens and 
non-human transgenic animals. 

The diseases of asthma, eczema and hay fever are typified by Immunoglobulin E 
mediated reactions to common allergens. These diseases are known as "atopic". They 

15 are increasing in prevalence, and are now a major source of disability throughout the 
developed world. They are the result of complex interactions between genetic and 
environment mechanisms. The atopic state is characterised by prolonged exuberant 
Immunoglobulin E (IgE) responses to common inhaled proteins, known as allergens. 
Atopy is accompanied by elevation of the total serum IgE concentration, and by the 

20 presence of IgE specific to allergens. This specific IgE may be measured directly in 
the serum, by ELISA or other techniques. It may also be detected by prick skin tests, 
in which minute traces of allergen are introduced through a prick in the skin surface: 
atopic individuals respond to the test with a visible wheal on the skin surface. 

25 Asthma is a major cause of disease in children and young adults and is becoming more 
prevalent and is the most common disease of childhood and arises due to the 
interaction between strong environmental and genetic factors. Asthma may be 
identified by intermittent mucosal inflammation, wheezing, and shortness of breath. 
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Asthma is usually recognised epidemiological^ by standard symptom questionnaires 
or by physician diagnosis. Physician-diagnosed asthma ascertained by questionnaire 
has a heritability of 60-70%. Linkage to asthma and its associated phenotypes has been 
demonstrated near the IL1 cluster on chromosome 2. Genome-wide scans for linkage 
5 to atopy and asthma-associated phenotypes have identified the marker D2S160 on 
chromosome 2ql4 near the IL-1 complex which has been associated with "wheeze" 
(p-0.001). As the region contains IL1 and its homologues, it was investigated for 
association between asthma and known SNPs in IL1 cluster genes. Contrary to 
expectations there was no association found to these known polymorphisms in a 
10 combined panel of 246 families containing 1122 individuals and 381 asthmatics. 
However a replicated association between asthma and alleles of a D2S308 
microsatellite, approximately 1Mb distant from the IL1 genes, has been found. 

The genetic factors which predispose an individual to asthma are thought to be 
15 variants of DNA structure ("polymorphisms") that alter the level of expression or the 
function of genes. Variants of DNA sequence at a particular site ("locus") are known 
as "alleles". 

Disease causing alleles will be in linkage disequilibrium with non-functional 
20 polymorphisms from the same chromosomal segment. It is therefore possible to detect 
allelic association with disease from particular chromosomal segments, without 
identifying the exact polymorphism and gene underlying the disease state. 

The detection of allelic association may therefore give information as to disease 
25 susceptibility in a particular individual. Furthermore, allelic association is indicative 
of exons of a disease-causing gene are usually present within a limited distance (50- 
500 Kb) of DNA in either direction from the allele. 

Identification of genetic polymorphisms in linkage disequilibrium with asthma, or 
30 atopy will allow the identification of children at risk of such diseases before the 
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disease has developed (for example immediately after birth), with the potential for 
prevention of disease. The presence of particular polymorphisms or combinations of 
polymorphisms may predict the clinical course of disease (e.g. severe as opposed to 
mild) or the response to particular treatments. This diagnostic information will be of 
use to the health care, pharmaceutical and insurance industries. 

Oligopeptidases are endopeptidases that act only on smaller polypeptides or 
oligopeptides. These enzymes perform important, specialized biological functions that 
include the modification or destruction of peptide messenger molecules. 
Oligopeptidases have few naturally occurring inhibitors, and their distinctive 
specificity prevents them from interacting with a2-macroglobulin, unlike the great 
majority of endopeptidases. Members of the prolyl-oligopeptidase family S9 (enzyme 
nomenclature committee EC 3.4.14.5), subfamily S9B include DPP4, DPP6, DPP8 
and DPP9. The S9 family contains serine proteases with a varied range of relatively 
restricted substrate specificities. S? peptidases are either soluble, cytosolic proteins or 
integral type II membrane-bound proteins and do not appear to exist as pro-enzymes, 
and are synthesised in an active form. The active site triad, Ser, Asp, His has been 
identified in DPP4, DPP6, DPP8, and DPP9, although the putative catalytic serine 
residue is substituted with aspartate in DPP6 and glycine in Drosophila Melanogaster 
CG9059. In all known members of the family, these residues are within 130 residues 
of the carboxyl-terminus. All members of the prolyl-oligopeptidase family contain a 
conserved 7 amino-acid motif DW(V fUL)YEEE in the predicted 0-propeller domain. 
Two of the glutamate residues within this conserved motif have been shown to be 
essential for serine protease enzyme activity. The membrane-bound members of the 
S9 family contain membrane-spanning domains near the amino-terminus. Prolyl- 
oligopeptidases are responsible for cleaving the C-terminal peptide bond adjacent to a 
di-peptide sequence Ala-Pro or Gly-Pro. Substrates for these enzymes include 
chemokines, growth factors, and neuro- and vaso-active peptides. 
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Certain cell surface peptidases have been identified as specific antigens of immune 
cells. These include DPP4 (CD26), neutral endopeptidase (CD10), and 
aminopeptidase N (GD13). These cell surface antigens play an important role in the 
differentiation of immune cells. It has also been postulated that prolyl-oligopeptidases 
5 may play a role in the alteration of the Thl/Th2 (T-helper cell) balance during the 
inflammatory process. DPP4 (CD26) has been shown to have multifunctional 
properties in addition to its peptidase activity. 

Based on it's proposed structure DPP4 (CD26) is predicted to be a multifunctional 
10 protein and has been shown to interact with several proteins outside of it's catalytic 
domain. DPP4 (CD26) is an adhesion receptor for both collagen and fibronectin and 
has been shown to mediate the lung colanisation of breast cancer cells in a rat model, 
an effect mediated primarily through fibronectin binding. DPP4 (CD26) can also bind 
adenosine deaminase, an enzyme that metabolises adenosine to inosine. The location 
15 of the DPP4/ADA complex at the cell surface is an important mechanism which 
regulates the effective concentration of adenosine in the vicinity of it's receptor. 
Adenosine suppresses T-cell activation through interaction with the A2a G-protein 
coupled receptor. Adenosine therefore plays an important role in suppression caused 
by ADA deficiency. 

20 

DPP4 (CD26) also plays an important role in T-cell signaling through interaction with 
CD45, protein tyrosine phosphatase. While the counter-receptor for DPP4 (CD26) 
has not yet been identified, cross-linking of DPP4 (CD26) with antibody induces 
tyrosine phosphorylation of a number of molecules known to be important in T-cell 
25 activation including p56M, ¥59fyn and ZAP-70. 

Of the S9B prolyl oligopeptidases, DPP6 homologues have been cloned from Bos 
taurus, Rattus norvegicus and Mus musculus. DPP6 has been reported to exist as 2 
different isoforms (DPPX-S and DPP-L) as a result of alternative mRNA splicing and 
30 possible differential (tissue-specific) promoter usage. DPPX-S and DPPX-L 
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demonstrate differential tissue expression. Given that the two forms have different 
cytoplasmic domains, this may form the basis for different transmembrane signalling 
systems. DPP6 has no detectable catalytic activity, most likely due to the substitution 
of serine for aspartate in the catalytic triad. In vitro mutagenesis of aspartate to serine 
5 failed to restore catalytic activity. 

To date, various markers of inflammatory diseases such as asthma and atopy have 
been identified, and used to identify those people at risk of such disease. Notably, 
however, there has been no success in linking such markers with a gene whose activity 
0 may be at the root of inflammatory disease. The identification of this gene will enable 
the provision of valuable diagnostic, therapeutic and research tools. 

According to a first aspect of the invention there is provided an isolated nucleic acid 
sequence comprising a sequence as shown in Figure 4, or a sequence as shown in 

5 Figure 4 which excludes one or more of the exon sequences as set out in Figure 10 
and/or Figure 4 when one or more of the exon sequences are replaced with one or 
more alternate exon sequence or one or more liver clone sequence from Figure 9 or a 
sequence complementary or substantially homologous thereto, or a fragment thereof. 
The sequence of Figure 4 is the human DPP10 mRNA sequence. Figure 4 is compiled 

) of the exons as shown in Figure 10; Alternate exons are shown in Figure 9. 
Alternative exons are referred to by different letters, for example exons la, lb etc are 
alternate sequences for exon 1 (likewise for exons 2A, 2B etc). Upper or lower case 
designation of exons does not have any significance. Alternate transcripts for Figure 4 
can be seen schematically in Figure 6. In addition, there is provided an isolated 
nucleic acid sequence comprising the sequence shown in Figure 5a and/or a sequence 
as shown in Figure 5a which excludes one or more exons or a sequence as set out in 
Figure 5a when one or more of the exons are replaced with an alternate exon from 
Figure 8. Figure 7 shows a schematic overview of mouse transcripts, or a sequence 
complementary or substantially homologous thereto, or a fragment thereof. The 
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sequence of Figure 5a is the mouse DPP10 cDNA sequence. The mouse DPP10 
mRNA sequence is shown in Figure 5. 

The DPP10 nucleic acid sequence can comprise any combination of one or more 
5 exons from la, lb, lc, Id, le, If, lg, 2a, 2b, 2c, 2d, 2e, 2f, 2g, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or the sequences of liver clones 
1, 2 or 3 or a sequence substantially homologous thereto, or a fragement thereof. 
These combinations are for all DPP10 sequences including human and mouse - where 
the exons of the mouse do not include If or lg. 

10 

The following isolated nucleic acid sequences are part of the first aspect. 

There is provided an isolated nucleic acid sequence comprising one or more of the 
exons of DPP10, for example those exons shown in Figures 6, 7, 8, 9, 10 or 15 or the 
15 sequence set out in Figure 2c, a sequence complementary or substantially homologous 
thereto, or a fragment thereof. 

There is provided an isolated nucleic acid sequence comprising the exon sequence of 
Figure 2a, or a sequence which is complementary or substantially homologous thereto. 

20 

There is provided an isolated nucleic acid sequence comprising exons la and 2 to 25, 
or a sequence which is complementary or substantially homologous thereto, or a 
fragment thereof (Transcript 1), Figure 10. 

25 There is provided an isolated nucleic acid sequence comprising exons lb, lc and 2 to 
25, or a sequence which is complementary or substantially homologous thereto, or a 
fragment thereof (Transcript 2), Figures 9 and 10. 
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There is provided an isolated nucleic acid sequence comprising exons lb, lc, Id, le 
and 2 to 25, or a sequence which is complementary or substantially homologous 
thereto, or a fragment thereof (Transcript 3), Figures 9 and 10. 

5 There is provided an isolated nucleic acid sequence comprising exons If and 2 to 25, 
or a sequence which is complementary or substantially homologous thereto, or a 
fragment thereof (Transcript 4), Figures 9 and 10. 

There is provided an isolated nucleic acid sequence comprising exons lg and 2 to 25, 
10 or a sequence which is complementary or substantially homologous thereto, or a 
fragment thereof (Transcript 5), Figures 9 and 10. 

There is provided an isolated nucleic acid sequence comprising exons la and 2A, or a 
sequence which is complementary or substantially homologous thereto, or a fragment 
15 thereof (Transcript 6), Figures 9 and 10. 

There is provided an isolated nucleic acid sequence comprising exons lb, lc and 2B, 
or a sequence which is complementary or substantially homologous thereto, or a 
fragment thereof, Figures 9 and 15. 

20 

There is provided an isolated nucleic acid sequence comprising exons lb, lc and 2C, 
or a sequence which is complementary or substantially homologous thereto, or a 
fragment thereof, Figures 9 and 15. 

25 There is provided an isolated nucleic acid sequence comprising exons lb, lc and 2D, 
or a sequence which is complementary or substantially homologous thereto, or a 
fragment thereof, Figures 9 and 15. 
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There is provided an isolated nucleic acid sequence comprising exons lb, lc and 2E, 
or a sequence which is complementary or substantially homologous thereto, or a 
fragment thereof, Figures 9 and 15. 

5 There is provided an isolated nucleic acid sequence comprising exons lb, lc and 2F, 
or a sequence which is complementary or substantially homologous thereto, or a 
fragment thereof, Figures 9 and 15. 

There is provided an isolated nucleic acid sequence comprising one or more of the 
10 mouse exons of DPP10, for example those exons shown in Figure 5, 5a, 7 or 8, or a 
sequence complementary or substantially homologous thereto, or a fragment thereof. 

There is provided an isolated nucleic acid sequence comprising mouse exons la and 2 
to 25, or a sequence which is complementary or substantially homologous thereto, or a 
15 fragment thereof (Mouse Transcript 1), Figures 5a, 7 and 8. 

There is provided an isolated nucleic acid sequence comprising mouse exons lc, Id 
and 2 to 25 or a sequence which is complementary or substantially homologous 
thereto, or a fragment thereof (Mouse Transcript 2), Figures 7 and 8. 

20 

There is provided an isolated nucleic acid sequence comprising mouse exons le and 2 
to 25- or a sequence which is complementary or substantially homologous thereto, or a 
fragment thereof (Mouse Transcript 3), Figures 7 and 8. 

25 There is provided an isolated nucleic acid sequence comprising mouse exons lb, lc, 
Id and 2 to 25 or a sequence which is complementary or substantially homologous 
thereto, or a fragment thereof (Mouse Transcript 4), Figures 7 and 8. 

Promoter sequence for DPP10: A consensus palindromic JFN-gamma activation site 
30 (GAS) element was identified 18181base pairs upstream from Exon lb. The GAS site 
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is underlined. Represented in italics are other important promoter motifs: a CATT 
box and a TATA box. 

C CA7TCTCriTGTTTTrATTCGGGATGCTC TAAA 

5 

This motif is also present in the promoter of the CD26Mipeptidylpeptidase IV (DPP4) 
gene (a member of the prolyl oligopeptidase S9B subfamily). Interferons (DRNs alpha, 
beta and gamma) and trans retinoic acid (RA) have the ability to activate genes with 
GAS sites. 

10 

All the sequences of the present invention are isolated, or alternatively may be 
recombinant. By isolated is meant a nucleic acid or polypeptide sequence which has 
been purified, and is substantially free of other protein and nucleic acid. Such 
sequences may be obtained by PCR amplification, cloning techniques, or synthesis on 
15 a synthesiser. By recombinant is meant nucleic acid sequences which have been 
recombined by the hand of man. 

The polynucleotide sequences of the invention may. be genomic or cDNA, or RNA, 
preferably mRNA, or PNA, In the present invention, gene products include 
20 polynucleotide sequences and protein. References to polypeptide sequences include 
proteins and peptides. 

In the present application, sequences which are complementary or substantially 
homologous are those sequences which hybridise under stringent conditions to the 
25 defined sequence or its gene products. Thus, for example, a nucleic acid sequence 
substantially homologous to a reference nucleic acid will be capable of hybridising to 
a gene product (i.e. mRNA) of the reference nucleic acid, under stringent conditions. 
A complementary sequence is one which is capable of hybridising to the nucleic acid 
sequence itself, under stringent conditions. Also provided in the present invention are 
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complements of the substantially homologous sequences. A substantially homologous 
sequence preferably has at least 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 
89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 100% sequence identity with the defined 
sequence. This definition of substantially homologous applies to both nucleic acid and 
5 polypeptide sequences. Thus, polypeptide sequences having conservative amino acid 
substitutions that do not affect structure or function are also included. For any given 
DNA sequence, references to a complementary sequence include the corresponding 
mRNA sequence and any cDNA sequence derived on such an RNA sequence. 

10 "% identity" is a measure of the relationship between two nucleic acid or polypeptide 
sequences, as determined by comparing their sequences. In general, the two 
sequences to be compared are aligned to give a maximum correlation between the 
sequences. The alignment of the two sequences is examined and the number of 
positions giving an exact amino acid or nucleotide correspondence is determined, and 

15 divided by the total length of the alignment, and the result is multiplied by 100 to give 
a % identity. The % identity may be determined over the whole length of the 
sequence to be compared, which is particularly suitable for sequences of the same or 
similar lengths or for sequences which are highly homologous, or over shorter defined 
lengths which is more suitable for sequences of unequal lengths and with a lower 

20 homology. 

Methods for comparing the identity of two or more sequences are known in the art. 
For example, programs available in the Wisconsin Sequence Analysis Package version 
9.1 (Devereux J et al, Nucl Acid Res 12 387-395 (1984), available from Genetics 
25 Computer Group, Madison, Wisconsin, USA), such- as BESTFTT and GAP may be 
used. 



BESTFTT uses the "local homology" algorithm of Smith and Waterman (Advances in 
Applied Mathematics, 2:482-489, 1981) and finds the best single region of similarity 
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between two sequences. BESTFIT is more suited to comparing two polynucleotide or 
two polypeptide sequences which are dissimilar in length, the program assuming that 
the shorter sequence represents a portion of the longer. In comparison, GAP aligns 
two sequences finding a "maximum similarity" according to the algorithm of 
Neddleman and Wunsch (J. Mol. BioL 48:443-354, 1970). GAP is more suited to 
comparing sequences which are approximately the same length and an alignment is 
expected over the entire' length. Preferably, the parameters "Gap Weight" and "Length 
Weight" used in each program are 50 and 3 for polynucleotide sequences and 12 and 4 
for polypeptide sequences, respectively. Preferably, % identities and similarities are 
determined when the two sequences being compared are optimally aligned. 



Other programs for determining identity and/or similarity between sequences are also 
known in the art, for instance the BLAST family of programs (Altschul et al, J. Mol 
Biol, 215:403-410, (1990) and Altschul et al, Nuc Acids Res., 25:289-3402 (1997), 
available from the National Centerfor Biotechnology Information (NCB), Bethesda, 
Maryland, USA and accessible through the home page of the NCBI at 
www.ncbi.nlm.nih.gov) and FASTA (Pearson W.R. and Lipman D.J., Proc. Nat Acac. 
Set, USA, 85:2444-2448 (1988), available as part of the Wisconsin Sequence 
Analysis Package). Preferably, the BLOSUM62 amino acid substitution matrix 
(Henikoff S. and Henikoff J.G., Proc. Nat. Acad Set, USA, 89:10915-10919, (1992)) 
is used in polypeptide sequence comparisons including where nucleotide sequences 
are first translated into amino acid sequences before comparison. 

Preferably, the program BESTFTT is used to determine the % identity of a query 
polynucleotide or a polypeptide sequence with respect to a polynucleotide or a 
polypeptide sequence of the present invention, the query and the reference sequence 
being optimally aligned and the parameters of the program set at the default value. 
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In relation to the present invention, "stringent conditions" refers to the washing 
conditions used in a hybridisation protocol. In general, the washing conditions should 
be a combination of temperature and salt concentration so that the denaturation 
temperature is approximately 5 to 20°C below the calculated T m of the nucleic acid 
5 under study. The T m of a nucleic acid probe of 20 bases or less is calculated under 
standard conditions (1M NaCl) as [4°C x (G-fC) + 2°C x (A+T)], according to Wallace 
rules for short oligonucleotides. For longer DNA fragments, the nearest neighbor 
method, which combines solid thermodynamics and experimental data may be used, 
according to the principles set out in Breslauer et al, PNAS 83: 3746-3750 (1986). 
10 The optimum salt and temperature conditions for hybridisation may be readily 
determined in preliminary experiments in which DNA samples immobilised on filters 
are hybridised to the probe of interest and then washed under conditions of different 
stringencies. While the conditions for PCR may differ from the standard conditions, 
the T m may be used as a guide for the expected relative stability of the primers. For 
15 short primers of approximately 14 nucleotides, low annealing temperatures of around 
44°C to 50°C are used. The temperature may be higher depending upon the base 
composition of the primer sequence used. Suitably stringent conditions are those 
under which non-specific hybridisation (e.g. to non-DPPIO encoding sequences) are 
avoided. Suitable stringent conditions are 0.5xSSC/l%SDS/58°C/30mins for a 21mer 
20 oligonucleotide probe. 



The complementary sequences of the invention (which may also be referred to herein 
as "antisense") may be useful as probes or primers, or in the regulation of DPP10 
expression. Preferably, the primer sequences are capable of amplifying all or a portion 
of a DPP10 gene. Preferred primer sequences are disclosed in the Examples. Pairs of 
primers for amplification of all or part of the gene, or alleles, or variants thereof, form 
another aspect of the invention. Similarly, DPP10 probes will be useful in detecting 
the presence or expression levels of DPP10, or variant forms thereof, in a sample from 
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a subject. The probes may also be useful in analysing the expression pattern of DPP 10 
in a subject. 

In the present application, fragments are any contiguous 10 residue sequence, or 
greater, such as 20, 30, 40, or 50 residue sequence. Preferably, fragments of nucleic 
acid or polypeptide sequences share one or more functional characteristics with 
DPP10 or its gene, or are capable of modulating (i.e. inhibiting or enhancing) such a 
fimctional characteristic. The novelty of a fragment according to the present 
embodiment may be easily ascertained by comparing the nucleotide or polypeptide 
sequence of the fragment with sequences catalogued in databases such as Genebank at 
the priority date, or by using computer programs such as DNASIS (Hitachi 
Engineering Inc) or Word Search or FASTA of the Genetic Computer Group 
(Madison, USA). 

The fragments may be used in a variety of diagnostic, prognostic or therapeutic 
methods or may be useful as research tools for example in screening. Fragments of 
the sequences of the first aspect or their complements may be used as primer 
sequences as described above. 

In a second aspect of the invention, the isolated nucleic acid sequences of the 
invention may be .provided in the form of a vector to enable the in vitro or in vivo 
expression of DPP10. Vectors include plasmids, chromosomes, artificial chromosomes 
and viruses and may be expression vectors, which are capable of expressing nucleic 
acid sequences in vitro or in vivo, or transformation vectors which are capable of 
transferring the nucleic acid sequence from one environment to another. The nucleic 
acid molecules of the invention may be operably linked to one or more regulatory 
elements including a promoter. 
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The term regulatory elements includes response elements, consensus sites, methylation 
sites, locus control regions, post-transcriptional modifications, splice variants, 
homeoboxes, inducible factors, DNA binding domains, enhancer sequences, initiation 
codons, secretion signals and, polyA sequences. Regions upstream or downstream of a 
promoter such as enhancers, which regulate the activity of the promoter are also 
regulatory elements. 



The vector may also comprise an origin of replication; appropriate restriction sites to 
enable cloning of inserts adjacent to the polynucleotide molecule; markers, for 
10 example antibiotic resistance genes; ribosome binding sites: RNA splice sites and 
transcription termination regions; polymerisation sites; or any other element, such a 
secretion signals, which may facilitate the cloning and/or expression of the 
polynucleotide molecule. 

15 . Within a vector the gene may be expressed upstream or downstream of an expressed 
protein tag such as a histidine tag, V5 epitope tag, green fluorescent protein tag, MHC 
tag or other such tag known to those skilled in the art. Use of such a tag allows easy 
localisation, affinity purification and detection of the fusion protein with an antibody 
to the tag moiety; 

20 

Where two or more nucleic acid molecules of the invention are introduced into the 
same vector, each may be controlled by its own regulatory sequences, or all molecules 
may be controlled by the same regulatory sequence. In the same manner, each 
molecule may comprise a 3' polyadenylation site. Examples of suitable vectors will be 
25 known to persons skilled in the art and include pBluescript II, lambdaZap, and pCMV- 
Script (Stratagene Cloning Systems, La Jolla, USA). 
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Appropriate regulatory elements, in particular, promoters will usually depend upon the 
host cell into which the expression vector is to be inserted. Where microbial host cells 
are used, promoters such as lactose promoter system, tryptophan (Tip) promoter 
system, p-lactamase promoter system or phage lambda promoter system are suitable. 
Where yeast cells are used, preferred promoters include alcohol dehydrogenase I or 
glycolytic promoters. In mammalian host cells, preferred promoters are those derived 
from immunoglobulin genes, SV40, Adenovirus, Bovine Papilloma virus etc. Suitable 
promoters for use in various host cells would be readily apparent to a person skilled in 
the art (See, for example, Current Protocols in Molecular Biology Edited by Ausubel 
et al y published by Wiley). In addition, the regulatory elements may be modified, for 
example by the addition of further regulatory elements, to achieve a desired expression 
pattern. 



By operably linked is meant that the components of the vector or sequence are in a 
relationship which allows them to function as intended. 



These vectors may be used to transform host cells, for example, prokaryotic or 
eukaryotic cells. These cells may be used in the production of recombinant DPP10 
gene products, or in the regulation or analysis of DPP10. The transformed host cells 
form part of the invention. Preferred cells include Kcoli, yeast, filamentous fungi, 
insect cells, mammalian cells, preferably immortalised, such as mouse, CHO, HeLa, 
Myeloma or Jurkat cell lines, human and monkey cell lines and derivatives thereof. 

According to a third aspect of the invention, there is provided a polypeptide sequence 
comprising a polypeptide sequence encoded by a nucleic acid sequence of the first 
aspect of the invention. Preferably the polypeptide sequences are encoded by a 
nucleic acid sequence of Figures 2a, 4, 5, 5a, 6, 9 or 10. 
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The third aspect of the invention includes a polypeptide sequence comprising a 
polypeptide sequence as shown in any one of Figures 2a, 2c, 4, 5, 5a, 8, 9, 11, 12 or 21 
or a sequence homologous thereto, or a fragment thereof. The sequences of Figures 
2c, 4, 9, 11 and 12 are the predicted human DPP10 polypeptide sequences. The 
5 sequences of Figures 5, 5a and 8 are the predicted mouse DPP10 polypeptide 
sequences. Figure 21 shows both mouse and human polypeptide sequences. 

In a preferred embodiment of the third aspect there is provided a membrane bound 
form of DPP10 protein (Figure 6, transcript 4; Figure 4, Figure 11). In particular the 
10 membrane bound form of DPP10 includes amino acids 35 to 56 of Figure 11 which 
form the transmembrane domain (also Figure 16). The mouse equivalents are 
transcripts 1, 2 and 3 (Figure 7). 

In a preferred embodiment of the third aspect, there is provided a soluble foim of the 
15 DPP10 protein (Transcripts 2, 3, 5 and 6 of Figure 6), which lacks a transmembrane 
domain or the catalytic domain (Figure 14), or the beta-propeller domain (Figure 17). 
In particular, the soluble DPP10 protein lacks amino acids 35 to 56 of Figure 11, 
which form the transmembrane domain. In a most preferred embodiment, the soluble 
DPP10 protein comprises amino acids 57 to 751 or 796 of Figure 11. The mouse 
20 equivalent is transcript 4 (Figure 7). 

The soluble DPP10 protein may be operably linked to a secretion signal, to assist its 
secretion from the golgi apparatus to another part of the cell. Suitable secretion 
signals can be provided by recombinant vectors such as pSecTag2 (Invitrogen 
25 Corporation, Carlsbad, CA). Proteins expressed from such vectots are fused at the N- 
terminus to the murine Ig kappa chain leader sequence. The secretion signal may be 
linked to the soluble DPP10 polypeptide sequence using techniques available in the 
art, including recombinant DNA technology. 
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The DPP10 protein or a sequence substantially homologous thereto of a fragment 
thereof may be subject to post-translational modification. Post-translational 
modification (PTM) is defined herein as including modification of a protein following 
translation by proteolytic cleavage e.g. cleavage of a preprotein, a proprotein or a 
5 preproprotein by removal of a signal sequence or activation of a zymogen. PTM also 
includes the attachment of a carbohydrate to a protein, the predominant sugars 
attached include glucose, galactose, mannose, fucose, GalNAC, GlcNAC and NANA. 
The carbohydrates may be linked to the protein either by O-glycosidic or N-glycosidic 
bonds. Also included are acylation; methylation; phosphorylation; sulfation and 
10 prenylation. Vitamin C-dependent modifications such as proline and lysive 
hydroxylation and carboxy terminal amidation and vitamin K-dependent modifications 
such as carboxylation of glutamine residues are also included as is the addition of 
selenium as selenocysterine in a protein. 

15 The polypeptide sequences of the third aspect are preferably functional and may be 
useful in drug screening, diagnosis or therapy. Functional fragments of DPP10 are 
those which share immunological or functional characteristics with the full length, 
membrane bound or soluble form of DPP10. Fragments may be at least 10, preferably 
15, 20, 25, 30, 35, 40 or 50 amino acids in length. Preferably, the polypeptide 

20 sequences are isolated. 

In a fourth aspect of the present invention, there are provided antibodies which are 
specific for an antigen of a. polypeptide sequence of the third aspect or an antigen of 
the isolated nucleic acid of the first aspect, or fragment of either aspect or which react 
25 with an antigen of a polypeptide sequence of the third aspect or the isolated nucleic 
acid of the first aspect, or fragment of either aspect. Herein the term "react" has the 
meaning that the antibody is able to. interact with the polypeptide or isolated nucleic 
acid. The term "specific for" has the meaning that the antibody specifically reacts with 
the polypeptide or isolated nucleic acid. 



5 
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Antibodies can be made by the procedure set forth by standard procedures (Harlow 
and Lane, "Antibodies; A Laboratory manual" Cold Spring Harbour Laboratory, Cold 
Spring Harbour, New York, 1998). Briefly, purified antigen can be injected into an 
animal in an amount and in intervals sufficient to elicit an immune response. 
Antibodies can either be purified directly, or spleen cells can be obtained from the 
animal. The cells are then fused with an immortal cell line and screened for antibody 
secretion. The antibodies can be used to screen DNA clone libraries for cells secreting 
the antigen. Those positive clones can then be sequenced as described in, for example, 
Kelly et ah, Biotechnology 10:163-167 (1992) and Bebbington et al, 
Bio/Technology 10:169-175 (1992). Preferably, the antigen being detected and/or 
used to generate a particular antibody will include polypeptide sequences according to 
the third aspect or isolated nucleic acid sequences according to the first aspect. The 
antibody may be a polyclonal or monoclonal antibody, a chimeric antibody, a 
humanised antibody or a bifunctional antibody or a fragment of any of the above. A 
15 bifunctional antibody is an antibody that can bind to two different antigens, these 
antigens may be different antigens present in the DPP10 polypeptide or isolated 
nucleic acid or may be an antigen of DPP10 combined with e.g. a cellular antigen. 

In particular, the antibody may be raised against a particular domain of DPP10, such 
20 as the cytosolic soluble form; the p-propeller domain, or the external domain. Such 
antibodies will be useful in diagnostic and therapeutic aspects of the invention. In 
particular, the antibodies will be useful in the development of assays for detecting or 
measuring DPP10 in a sample from a subject. 



25 



In a preferred embodiment, the antibody may be used to assay the level of cytosolic, 
soluble DPP10 protein in a serum sample obtained from a subject (Figure 27). 



30 



According to a fifth aspect of the invention, there is provided a process for the 
preparation of a nucleic acid sequence as defined above, the process comprising 
ligating together successive nucleotide and/or oligonucleotide residues together. Such 
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a process may be carried out using chemical synthesis methods or by using enzymic 
catalysis. Alternatively, a suitable host cell may be transfected with an appropriate 
DNA or RNA sequence so as to cause production of the desired sequence in a host 
cell. 

5 

In a sixth aspect of the invention, there is provided a process for the preparation of a 
polypeptide as defined above, the process comprising ligating together successive 
- amino acids and/or oligonucleotides together. Such a process may be carried out using 
chemical synthesis methods or by using enzymic catalysis. Alternatively, a suitable 
10 host cell may be transfected with an appropriate DNA or RNA sequence so as to cause 
production of the desired sequence in a host cell. 

In the context of the present invention, references to DPP10 include the soluble, 
membrane bound or insoluble forms of the protein, which comprise at least the 
15 polypeptide sequence encoded by exohs 2 to 25. Thus, references to DPP10 include 
proteins encoded by transcripts having one or more of exons la, lb, lc, Id, le, If or 
lg. 

In a seventh aspect of the present invention, there is provided the following; a cell 
20 comprising a nucleic acid sequence according to an aforementioned aspect of the 
invention; or a transgenic non-human animal comprising a nucleie acid sequence 
according to an aforementioned aspect of the invention. Such cells (either alone, in, 
suspension, in culture or as part of a group of cells representing an organ) and 
transgenic non-human animals are useful for the analysis of a single nucleotide 
25 polymorphisms and their phenotypic effect and so for the analysis of DPP10 and its 
phenotypic effect. Expression of a polynucleotide sequence of the invention in a 
transgenic non-human animal is usually achieved by operably linking the 
polynucleotide to a promoter and/or enhancer sequence, preferably to produce a vector 
of the above aspect, and introducing this into an embryonic stem cell of a host animal 
30 by microinjection techniques (Hogan et aU A Laboratory Manual, Cold Spring 
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harbour and Capecchi Science (1989) 244: 1288-1292). The transgene construct 
should then undergo homologous recombination with the endogenous gene of the host. 
Those embryonic stem cells comprising the desired nucleic acid sequence may be 
selected, usually by monitoring expression of a marker gene, and used to generate a 
non-human transgenic animal. Preferred host animals include mice, rabbits and other 
rodents. 



The nucleic acid sequence introduced may not be native to the host animal, i.e. it may 
be foreign. Such transgenic animals may be distinguished from native, non-transgenic 
10 animals using methods known in the art, for example a nucleic acid sample from the 
transgenic animal may be compared with that from a native animal - the transgenic 
animal will have a nucleic acid sequence such as a foreign promoter, marker genes etc. 
Alternatively, the phenotypes of the animals can be compared. 

15 Where it is desirable to use the transgenic non-human animal of the seventh aspect to 
study disease, it may be desirable for the nucleic acid introduced into the animal to 
encode a variant of DPP10 which results in allergy, atopy, asthma or inflammatory 
disease including rheumatoid arthritis,, ankylosing spondylitis, inflammatory bowel 
disease, Crohn's disease, multiple sclerosis or type II diabetes. In such embodiments 

20 where the disease has been artificially introduced, the transgenic non-human animal 
will be modulated such that it no longer expresses the native DPP10 gene. These 
animals may be referred to as "knock-out" (Manipulating The Mouse Embryo- A 
Laboratory Manual, Hogan et al 1986). In some cases, it may be desirable to modulate 
the expression of the foreign nucleic acid and/or the native gene in a temporal or 

25 spatial manner. This approach removes viability problems if the expression of the 
native gene is abolished in all tissues. 
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' In a most preferred embodiment, there is provided a transgenic mouse comprising a 
nucleic acid encoding a variant form of DPP10 which causes allergy, atopy, asthma or 
inflammatory disease including rheumatoid arthritis, ankylosing spondylitis, 
inflammatory bowel disease, Crohn's disease, multiple sclerosis or type H diabetes. 
Most preferably, the nucleic acid molecule comprises a SNP at the position which 
corresponds to Position 259007 (where the base adenine is changed to cytosine), 
Position 267901 (where the base adenine is changed to guanine) and/or Position 
318524 (where the base thymine is changed to cytosine) of Figure 1. Preferably, the 
mouse is modulated so that it no longer expresses DPP10 in a temporally and/or 
spatially appropriate manner using homologous recombination techniques or 
alternatively to over express DPP10 protein as a result of transgenic manipulation. 

If a functional polymorphism is identified (i.e. a "mutation") a DPP10 construct 
containing this polymorphism can be introduced into the mouse , germ line (i.e. a 
knock-in) to produce a pathological variant of the DPP10 protein rather than knocking 
it out. Alternatively a pathological variant of the DPP10 gene may be overexpressed. 

In the context of the present invention, inflammatory diseases include those resulting 
from overexpression of DPP10, or the presence of a variant form of DPP10. 
Specifically, such diseases include allergies, and atopic diseases such as asthma, or 
inflammatory bowel disease e.g. ulcerative' colitis and Crohn's disease, and 
inflammatory joint disease such as rheumatoid arthritis and ankylosing spondylitis, or 
psoriasis, multiple sclerosis or type II diabetes. 



In an eighth aspect of the present invention, there is provided a method of diagnosing, 
or determining susceptibility of a subject to inflammatory disease. The method may 
comprise determining the presence of a variant form of DPP10, which is known to be 
associated with a disease state, or measuring the levels of DPP10. A variant form of 
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DPP10 includes nucleic acid and amino acid variants. A variant includes any SNP 
from the wild-type (e.g. for humans) (Figure 1) or other mutation or alteration from 
the wild-type. 

5 For example, probes or primers as described above may be useful in detecting nucleic 
acid encoding DPP10 or a variant thereof Information regarding the expression 
pattern or forms of DPP10 present will be useful in determining whether the individual 
is susceptible to inflammatory diseases, resulting from altered expression of DPP10; 
possibly by influencing the ratio of membrane bound to cytosolic forms of the DPP10 
10 protein. 

In a preferred embodiment, the method may additionally, or alternatively, comprise 
determining the presence or absence of a risk allele of one or more of the SNPs of 
Tables la, lb and lc or Table 3, where presence of a risk allele is indicative of disease 
15 or predisposition to disease.. The method may also comprise genotyping one or more 
known polymorphisms. Any combination of such polymorphisms may be genotyped. 

The SNPs of the invention are listed in Tables la, lb and lc or Table 3, where the 
nature of the polymorphism is described in the format wild type allele/variant allele. 
20 For example, the SNP at position 21818 (denoted 69WTC50W) has an adenosine 
residue in the wild type sequence, which is substituted for a guanine residue in the 
variant sequence. The SNPs are positioned with respect to Figure 1, where nucleotide 
position 1 is the 1 st nucleotide in the Figure L 

25 Preferably, the allele of the SNP polymorphisms are as follows: a nucleotide residue 
other than adenine at position 259007; a nucleotide residue other than adenine at 
position 267901; and a nucleotide residue other than thymine at position 318524 of 
Figure 1. More specifically, the risk allele are a cytosine residue at position 259007; a 
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guanine residue at position 267901; and a cytosine residue at position 318524 of 
Figure 1. 

The alleles for the remaining SNPs identified in the present invention are described in 
5 Tables la, lb and lc or Table 3. 

Any technique, including those known to persons skilled in the art, may be used in the 
above method. These may include the use of probes or primers as described above, or 
antibodies of the fourth aspect, for example in ELISA assays or in immuno- 
10 localisation. Preferably, the method comprises first removing a sample from a subject. 
More preferably, the method comprises isolating from a sample a nucleic acid or a 
polypeptide sequence. 

In particular, methods for use in this aspect include those known to persons skilled in 
15 the art for identifying differences between nucleic acid sequences, for example direct 
probing, allele, specific hybridisation, PCR methodology including Pyrosequencing 
(Ahmadian A, Gharizadeh B, Gustafsson AC, Sterky F, Nyren P, Uhlen M, Lundeberg 
J. Single-nucleotide polymorphism analysis by pyrosequencing, Anal Biochem. 2000 
Apr 10;280(1): 103-10; Nordstrom T, Ronaghi M, Forsberg L, de Faire U, 
20 Mofgenstem R, Nyren P. Direct analysis of single-nucleotide polymorphism on 
double-stranded DNA by pyrosequencing. Biotechnol Appl Biochem. 2000 Apr;31 
(Pt 2): 107-12) Allele Specific Amplification (ASA) (W093/22456), Allele Specific 
Hybridisation, single base extension (US patent No. 4,656,127), ARMS-PCR, Taqman 
™ (US 4683202; 4683195; and 4965188), oligo ligation assays, single-strand 
25 conformational analysis ((SSCP) Orita et al PNAS 86 2766-2770 (1989)), Genetic Bit 
Analysis (WO 92/15712) and RFLP direct sequencing, mass-spectrometry (MALDI- 
TOF) and DNA arrays. The appropriate restriction enzyme, will, of course, be 
dependent upon the polymorphism and restriction site, and will include those known 
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to persons skilled in the art. Analysis of the digested fragments may be performed 
using any method in the art, for example gel analysis, or southern blots. 

There is provided a method of diagnosing, or determining predisposition to disease, 
5 comprising determining the presence or absence of a risk allele of a SNP at position 
259007 of Figure 1, wherein presence of the risk allele is diagnostic of disease or 
predisposition of disease. 

In addition, there is provided a method for diagnosing, or determining predisposition 
10 to, disease comprising determining the presence or absence of risk alleles of a SNP at 
positions 267901 and/or 318524 of Figure 1, wherein presence of a risk allele is 
diagnostic of disease or predisposition to disease. 

The present invention is advantageous in that it facilitates the accurate diagnosis of 
15 disease, or the determination of predisposition to disease. Thus, by genotyping, an 
individual may be identified as having or being predisposed to disease. This helps to 
identify those individuals who are likely to respond positively to particular treatments 
or preventative measures. Thus, more effective therapies or preventative measures can 
be administered. 

20 

The diseases which are associated with the polymorphisms of the invention include 
inflammatory diseases, such as inflammatory bowel disease, e.g. ulcerative colitis and 
Crohn's disease, and inflammatory joint disease, such as rheumatoid arthritis and 
ankylosing spondylitis, or multiple sclerosis or type II diabetes. Predisposition to 
25 disease in the context of the present invention means that these individuals are at 
higher risk of developing the disease, or a more severe form of the disease, or a 
particular form of the disease. 
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In the context of the present invention, a risk allele is the allele of a polymorphism 
which is associated with disease or predisposition to disease. The risk allele may be 
the wild type or the variant allele, as defined below. 

5 The term "polymorphism" refers to the coexistence of multiple forms of a sequence. 
Thus, a polymorphic site is the location at which sequence divergence occurs. The 
different forms of the sequence which exist as a result of the presence of a 
polymorphism are referred to as "alleles". The region comprising a polymorphic site 
may be referred to as a polymorphic region. 

10 

Examples of the ways in which polymorphisms are manifested include restriction 
fragment length polymorphisms (Botstein et al Am J Hum Genet 32 314-331 (1980)), 
variable number of tandem repeats, hypervariable regions, minisatellites, di- or multi- 
nucleotide repeats, insertion elements and nucleotide or amino acid deletions, 
15 additions or substitutions. A polymorphic site may be as small as one base pair, which 
may alter a codon thus resulting in a change in the encoded amino acid sequence. 

Single nucleotide polymorphisms arise due to the substitution, deletion or insertion of 
a nucleotide residue at a polymorphic site. Such variations are referred to as SNPs. 

20 SNPs may occur in protein coding regions, in which case different polymorphic forms 
of the sequence may give rise to variant protein sequences. Other SNPs may occur in 
non-coding regions. In either case, SNPs may result in defective proteins or regulation 
of genes, thus resulting in disease. Other SNPs may have no phenotypic effects, but 
may show linkage to disease states, thus serving as markers for disease. SNPs 

25 typically occur more frequently throughout the genome than other forms of 
polymorphism discussed above, and there is therefore a greater probability of finding a 
SNP associated with a particular disease state. 
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Linkage disequilibrium is the co-inheritance of two alleles at greater frequencies than 
would be expected from the separate frequencies of each allele. Conversely, alleles 
are in linkage equilibrium if they occur together. The expected frequency of two 
alleles inherited together is the product of the frequency of each allele. 

5 . 

Where two or more polymorphisms are genotyped, the method preferably defines 
determining the presence or absence of a haplotype which is indicative of disease or 
predisposition to disease. A haplotype is defined herein as a collection of polymorphic 
sites in a particular sequence that are inherited in a group, i.e. are in linkage 

10 disequilibrium with each other. The identification of haplotypes in the diagnosis of 
disease helps to reduce the possibility of false positives. The haplotype may be any 
. particular combination of polymorphisms of Tables la, lb and lc or Table 3, 
optionally in combination with one or more known polymorphisms. A preferred 
haplotype is the combination of SNPs at positions 259007, 267901 and 318524 of 

15 Figure 1. 

The methods of the eighth aspect are preferably carried out on a sample removed from 
a subject. Any biological sample comprising cells containing nucleic acid, preferably 
that of Figure 1, is suitable for this purpose. Examples of suitable samples include 
20 whole blood, leukocytes, semen, saliva, tears,' buccal, skin or hair. For analysis of 
cDNA, mRNA or protein, the sample must come from a tissue in which the sequence 
of interest is expressed. Blood is a readily accessible sample. Thus, the method of the 
eighth aspect preferably includes the steps of obtaining a sample from a subject, and 
preparing nucleic acid from the sample. 

25 

The subject is preferably a mammal, and more preferably a human. The subject may 
be an infant, a child or an adult. Alternatively, the sample may be obtained from the 
subject prepartum e.g. by amniocentesis. 
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A subject's risk factor for disease may be determined with reference also to other 
known genetic factors, and/or clinical, physiological or dietary factors. 

The above described methods may require amplification of the DNA sample from the 
subject, and this can be done by techniques known in the art, such as PCR (see PCR 
Technology: Principles and Applications for DNA Amplification (ed. H. A. Erlich, 
Freeman Press, NY 1992; PCR Protocols; A Guide to methods and Applications (eds. 
Innis et al, Academic press, San Diego, CA 1990); Mattila et al., Nucleic Acids Res. 
19 4967 (1991); Eckert et al, PCR Metliods and Applications 117 (1991) and US 
Patent No 4, 683, 202. Other suitable amplification methods include ligase chain 
reaction (LCR) (Wu et al, Genomics 4 560 (1989); Landegran et al, Science 241 
1077 (1988)), transcription amplification (Kwoh et al, Proc Natl Acad Sci USA 86 
1173 (1989)), self sustained sequence replication (Guatelli et al, Proc Natl Acad Sci 
USA 87 1874 (1990)) and nucleic acid based sequence amplification (NASBA). The 
latter two methods both involve isothermal reactions based on isothermal transcription 
which produce both single stranded RNA and double stranded DNA as the 
amplification products, in a ratio of 30 or 100 to 1, respectively. 

Where it is desirable to analyse multiple samples simultaneously, it may be preferable 
to use arrays as described in W095/11995. The array may contain a number of 
probes, each designed to identify variants of DPP10 from a sample. 

Where a restriction enzyme is required, it can be selected according to the nature of 
the polymorphism and restriction site. Suitable enzymes will be known to persons 
skilled in the art. Analysis of the digested fragments may be performed using any 
method in the art, for example gel analysis, or southern blots. 
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Determination of an allele of a polymorphism using the above methods typically 
involves the use of anti-sense sequences i.e. sequences which are complementary to 
the nucleic acid sequences of interest, which may include part of the sequence of 
Figure 1. Such sequence are described in the third aspect of the invention. 

5 

Where it is desirable to identify the presence of multiple single nucleotide 
polymorphisms, or haplotypes, in a sample from a subject, it may be preferable to use 
arrays as described in W095/11995. The array may contain a number of probes, 6ach 
designed to identify one or more of the above single nucleotide polymorphisms of the 
10 invention. 

An antibody to DPP10 as previously described may be used in the method of the 
eighth aspect. The detection of binding of the antibody to the antigen in a sample may 
be assisted by methods known in the art, such as the use of a secondary antibody 

15 which binds to the first antibody, or a ligand. Immunoassays including 
immunofluorescence assays (IF A) and enzyme linked immunosorbent assays (ELIS A) 
and immunoblotting may be used to detect the presence of the antigen. For example, 
where ELISA is used, the method may comprise binding the antibody to a substrate, 
contacting the bound antibody with the sample containing the antigen, contacting the 

20 above with a second antibody bound to a detectable moiety (typically an enzyme such 
as horse radish peroxidase or alkaline phosphatase), contacting the above with a 
substrate for the enzyme, and finally observing the colour change which is indicative 
of the presence of the antigen in the sample. 

25 Any biological sample comprising cells containing nucleic acid or protein is suitable 
for this purpose. Examples of suitable samples include whole blood, semen, saliva, 
tears, buccal, skin or hair. For analysis of cDNA, mRNA or protein, the sample must 
come from a tissue in which DPP10 is expressed. Peripheral blood leukocytes are a 
readily accessible sample. 
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According to a ninth aspect of the invention, there is provided a method of preventing 
or treating disease in a subject, wherein the method comprises modulating the activity, 
expression, half life or post translational modification of DPP10 in the subject. 

5 Preferably, the method is carried out in a subject who has been diagnosed as suffering 
from, or is susceptible to allergies, atopic diseases such as asthma or inflammatory 
bowel disease e.g. ulcerative colitis and Crohn's disease, and inflammatory joint 
disease such as rheumatoid arthritis and ankylosing spondylitis, or psoriasis, multiple 
sclerosis or type II diabetes 

10 

Preferably, the method comprise determining the presence or absence of a risk allele 
of a SNP such as one which has an association with allergy, atopy, asthma or 
inflammatory disease e.g. at position 259007, 267901 and/or 318524 of Figure 1; and 
if the risk allele is present, administering treatment in order to prevent, delay or reduce 
15 the disease. 

Preferably, the step of comprising determining the presence or absence of a risk allele 
is carried out in accordance with the eighth aspect, and therefore also comprises 
determining the presence or absence of risk alleles of SNPs of Tables la, lb and lc or 
20 Table 3, or any combination thereof, for example as described above. 

The prevention or treatment of disease according to the ninth aspect may include the 
administration of any agent capable of modulating the effects of DPP10 or of the 
disease-causing allele. Preferably, the agent is one which is capable of ameliorating 
25 the deleterious effects of the risk allele. The methods include, but are not limited to, 
gene therapy techniques. Gene therapy techniques typically involve replacing the 
nucleic acid sequence comprising the risk allele, or otherwise down regulating the 
effects of the risk allele. The nucleic acid sequences of the second aspect, or 
sequences anti-sense thereto, will be useful in gene therapy. 



WO 02/086113 




•CT/GB02/01887 



30 



By modulating is meant inhibiting or increasing the activity of the enzyme. 
Preferably, the activity is inhibited. The activity of the enzyme includes any aspect of 
its production or function, including transcription and translation of nucleic acid 
sequences, and assembly of the protein, and downstream interactions with other 
5 factors. 

The DPP10 activity can be modulated in a number of ways. For example, the 
expression of the gene may be inhibited through the use of antisense sequences, such 
as those of the first aspect of the invention or by the production of anti sense RNA 
10 sequences. Such sequences when introduced into a subject by gene therapy will 
hybridise to the DPP10 gene or RNA, and inhibit its transcription or translation. This 
method may be particularly useful where it is desirable to modulate the function or 
expression of certain splice variants of DPP10 whilst not affecting others. 

15 Introduction of a nucleic acid sequence may use gene therapy methods including those 
known in the art. In general, a nucleic acid sequence will be introduced into the target 
cells of a subject, usually in the form of a vector and preferably in the form of a 
phannaceutically acceptable carrier. Any suitable delivery vehicle may be used, 
including viral vectors, such as retroviral vector systems which can package a 

20 recombinant genome. The retrovirus could then be used to infect and deliver the 
polynucleotide to the target cells. Other delivery techniques are also widely available, 
including the use of adenoviral vectors, adeno-associated vectors, lentiviral vectors, 
pseudotyped retroviral vectors and pox or vaccinia virus vectors. Liposomes may also 
be used, including commercially available liposome preparations such as Dpofectin 

25 ®, Lipofectamine ®, (GDBCO-BRL, Inc. Gaitherburg, MD), Superfect ® (Qiagen Inc, 
Hilden, Germany) and Transfectam ® (Promega Biotec Inc, Madison WI). 

Other means to modulate a biological activity of DPP10 include agents which may 
affect interaction of DPP10 with downstream factors with which it interacts. For 
30 example, the activity of DPP10 may be affected by inhibiting its interaction with 
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molecules containing the XPXS motif such as chemokines and cytokines, e.g. those 
shown in Figure 19. In particular, the activity of DPP10 may be inhibited by the use 
of competitive or non-competitive inhibitors of any one or more of these chemokines 
or cytokines, or by small molecule inhibitors, which may function by inhibiting the 
5 active triad of DPP10. Other methods of inhibition will be known to persons skilled in 
the art. 

Also provided is an agent, for use in the prevention or treatment of inflammatory 
disease in a subject, as defined above. Agents include nucleic acid sequences of the 
10 first or second aspects, polypeptide sequences of the third aspect, antibodies of the 
fourth aspect, and any other agent defined herein, preferably those which are capable 
of modulating the activity of DPP10. 

The subject may be any animal, preferably a mammal, and more preferably human. 

15 

Also provided is the use of an agent as defined above in the manufacture of a 
medicament for use in the prevention or treatment of inflammatory disease, as defined 
above, in a subject. 

20 In a tenth aspect of the invention there is provided isolated nucleic acid molecules 
comprising part of the sequence of Figure 1, and comprising one or more SNPs at 
positions which correspond to the positions of Figure 1 listed in any one or more 
Tables la, lb, lcor3. 

15 Particular isolated nucleic acid molecules include those: 



comprising a SNP at the position corresponding to position 259007 of Figure 1 where 
the adenine is changed to cytosine. 
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comprising a SNP at the position corresponding to position 267901 of Figure 1 where 
the adenine is changed to guanine. 

comprising a SNP at the position corresponding to position 318524 of Figure 1 where 
5 the thymine is changed to cytosine. 

The nucleic acid molecules of the invention may be DNA, RNA, and single or double 
stranded sequences. All the molecules of the present invention are isolated, or 
alternatively may be recombinant. By isolated is meant a nucleic acid molecule which 
10 has been purified, and is substantially free of protein and other nucleic acid. Such 
molecules may be obtained by PCR amplification, cloning techniques, or synthesis on 
a synthesiser. By recombinant is meant nucleic acid molecules which have been 
recombined by the hand of man. 

15 The isolated nucleic acid molecules of the present invention are different to the "wild 
type" or "reference" sequence of Figure 1. The sequence of Figure 1, a 465Kb region 
surrounding the D2S308 marker, described in WO99/50451 as being associated with 
asthma, is derived from a BAC/PAC contig which is not part of the invention. The 
BACs in the contig are 416L5, 543L9 and 317L18. The PAC is 69CE4 is not part of 

20 the invention. The nucleic acid sequences of the invention which differ from the 
sequence of Figure 1 at any one or more of the positions detailed in Tables la, lb, lc 
or 3 are referred to as polymorphic variants of the sequence of Figure 1 and form part 
of the invention. 

25 This aspect of the invention also provides antisense sequences. Such sequences are 
typically single stranded and are capable of hybridizing to the above mentioned 
nucleic acid sequences of the invention, or to the sequence of Figure 1, under stringent 
conditions. Preferred antisense sequences are those which are capable of hybridising 
to an allele of a polymorphism of the invention, and most preferably is capable of 
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distinguishing between alleles of a polymorphism (of Tables la, lb, lc or 3). 
Stringent conditions are defined below. The antisense sequences may be prepared 
synthetically or by nick translation, and are preferably isolated or recombinant. 

5 The antisense sequences include primers and probes, for example for use in the 
methods of the present invention. Primer sequences are capable of acting as an 
initiation site for template directed nucleic acid synthesis, under appropriate conditions 
which will be known to skilled persons. Probes are useful in the detection, 
identification and isolation of particular nucleic acid sequences. Probes and primers 
10 are preferably 15 to 30 nucleotides in length. 

For amplification purposes, pairs and primers are provided. These include a 5' primer 
which hybridizes to the 5' end of the nucleic acid sequence to be amplified, and a 3' 
primer which hybridizes to the complementary strand of the 3' end of the nucleic acid 
15 to be amplified. Preferred primers are those listed in Table 2. . 

Probes and primers may be labelled, for example to enable their detection. Suitable 
labels include for example, a radiolabel, enzyme label, fluoro-label, biotin-avidin label 
for subsequent visualization in, for example, a southern blot procedure. A labelled 
20 probe or primer may be reacted with a sample DNA or RNA, and the areas of the 
DNA or RNA which carry complimentary sequences will hybridise to the probe, and 
become labelled themselves. The labelled areas may be visualized, for example by 
autoradiography. 

25 Preferably, the probes and/or primers hybridise under, "stringent conditions", which 
refers to the washing conditions used in a hybridisation protocol. The hybridisation 
conditions for probes are preferably sufficiently stringent to allow distinction between 
different alleles of a polymorphism upon binding of the probes. In general, the 
washing conditions should be combination of temperature and salt concentration so 
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that the denaturation temperature is approximately 5 to 20°C below the calculated T m 
of the nucleic acid under study. The T m of a nucleic acid probe of 20 bases or less is 
calculated under standard conditions (1M NaCl) as [4°C x (G+C) + 2°C x (A+T)], 
according to Wallace rules for short oligonucleotides. For longer DNA fragments, the 
5 nearest neighbor method, which combines solid thermodynamics and experimental 
data may be used, according to the principles set out in Breslauer et aL, PNAS 83: 
3746-3750 (1986). The optimum salt and temperature conditions for hybridisation 
may be readily determined in preliminary experiments in which DNA samples 
immobilised on filters are hybridised to the probe of interest and then washed under 

10 conditions of different stringencies. While the conditions for PCR may differ from the 
standard conditions, theT m may be used as a guide for the expected relative stability of 
the primers. For short primers of approximately 14 nucleotides, low annealing 
temperatures of around 44°C to 50°C are used. The temperature may be higher 
depending upon the base composition of the primer sequence used. Typically, the salt 

15 concentration is no more than 1M, and the temperature is at least 25°C. Suitable 
conditions are 5XSSPE (750 mM NaCI, 50mM NaPhosphate, 5mM EDTA pH 7.4) 
and a temperature of 25^30°C, 

In an eleventh aspect, there is provided a host cell comprising a vector or isolated 
20 nucleic acid molecule according to the aforementioned aspects. The host cell may 
comprise an expression vector, or naked DNA encoding the nucleic acid molecules of 
the invention. A wide variety of suitable host cells are available, both eukaryotic and 
prokaryotic. Examples include bacteria such as Exoli, yeast, filamentous fungi, insect 
cells, mammalian cells, preferably immortalised, such as mouse, CHO, HeLa, 
25 myeloma or Jurkat cell lines, human and monkey cell lines and derivatives thereof. 
The host cells are preferably capable of expression of the nucleic acid sequence to 
produce a gene product (i.e. RNA or protein). Such host cells are useful in drug 
screening systems to identify agents for use in diagnosis or treatment of individuals 
having, or being susceptible to inflammatory disease as defined above. 
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The method by which said nucleic acid molecules are introduced into a host cell will 
usually depend upon the nature of both the vector/DNA and the target cell, and will 
include those known to a person skilled in the art. Suitable known methods include 
but are not limited to fusion, conjugation, liposomes, immunoliposomes, lipofectin, 
5 transfection, transduction, eletroporation or injection, as described in Sambrook et al 

In a twelfth aspect of the invention there is provided a kit for diagnosis of disease or 
predisposition to disease, comprising a means for determining the presence or absence 
of a risk allele of a SNP of Tables la, lb, lc or 3, wherein the risk allele is diagnostic 
10 of disease or of predisposition to disease. 

In a preferred embodiment, the kit comprises a means for determining the presence or 
absence of one or more risk alleles of polymorphisms according to the eighth aspect. 
In particular, the kit comprises means for determining the presence or absence of a risk 
15 allele of a SNP at position 259007, position 267901, and/or position 318524 of Figure 
1. 

Preferably the kit will comprise the components necessary to determine the presence 
or absence of a risk allele, in accordance with the eighth aspect of the invention. Such 

20 components include PCR primers and/or probes, for example those described above, 
PCR enzymes, restriction enzymes, and DNA or RNA purification means. Preferably, 
the kit will contain at least one pair of primers, or probes, preferably as described 
above in accordance with the tenth aspect of the invention. The primers are preferably 
allele specific primers. Other components include labeling means, buffers for the 

25 reactions. In addition, a control nucleic acid sample may be included, which 
comprises a wild type or variant nucleic acid sequence as defined above, or a PCR 
product of the same. The kit will usually also comprise instructions for carrying out 
the diagnostic method, and a key detailing the correlation between the results and the 
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likelihood of disease. The kit may also comprise an agent for the prevention or 
treatment of disease. 

In a thirteenth aspect of the invention, there is provided a method of identifying a 
5 compound for treatment of disease, comprising (a) administration of a compound to 
tissue comprising an isolated nucleic acid molecule comprising a SNP at a position 
which corresponds to a position of Figure 1 listed in Tables la, lb, lc or 3; and (b) 
determining whether the agent modulates effects of the SNP. 

10 In a preferred embodiment, the isolated nucleic acid molecule is according to the tenth 
aspect of the invention, and most preferably comprises a SNP at a position 
corresponding to position 259007, position 267901, and/or position 318524 of Figure 
1. 

15 In this aspect, a nucleic acid molecule of the invention, and/or a cell line according to 
an aforementioned aspect, may be used to screen for agents which are capable of 
modulating the effect of a SNP. 

Potential agents are those which react differently with a risk allele and non-risk allele. 

20 Putative agents will include those known to persons skilled in the art, and include 
chemical or biological compounds, sense or anti-sense nucleic acid sequence for 
example as described above, binding proteins, kinases, and any other gene or gene 
product agonist or antagonist. Preferably, the agent will be capable of modulating the 
effects of the disease causing allele. Most preferably, the agent is one which is 

25 capable of ameliorating the deleterious effects of the risk allele. 
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Such agents may be suitable for either prophylactic administration or after a disease 
has been diagnosed. The route of administration is suitably chosen according to the 
disease or condition to be treated, however, typical routes of administration of the 
agent of the present invention include but are not limited to oral, rectal, intravenous, 
5 parenteral, intramuscular and sub-cutaneous routes. The invention also provides for * 
agents to be administered either as DNA or RNA and thus as a form of gene therapy. 
The agents may be delivered into cells directly by means including but not limited to 
liposomes, viral vectors and coated particles (gene gun). 

10 In a fourteenth aspect of the present invention there is provided an agent or antibody 
as described above according to the invention, or use in preventing or treating 
allergies, and atopic diseases such as asthma or inflammatory bowel disease e.g. 
ulcerative colitis and Crohn's disease, and inflammatory joint disease such as 
rheumatoid arthritis and ankylosing spondylitis, or psoriasis, multiple sclerosis or type 

15 II diabetes. 

There is also provided the use of an agent or antibody as described above in the 
manufacture of a medicament for use in the prevention or treatment of allergies, and 
atopic diseases such as asthma or inflammatory bowel disease e.g. ulcerative colitis 
20 and Crohn's disease, and inflammatory joint disease such as rheumatoid arthritis and 
ankylosing spondylitis, or psoriasis, multiple sclerosis or type II diabetes. 

According to a fifteenth aspect of the invention, there is provided, a pharmaceutical 
composition comprising a nucleic acid or polypeptide sequence as defined above 
25 according to the invention. Alternatively, the pharmaceutical composition may 
comprise an agent as defined in relation to the above aspect or an antibody according 
to the fourth aspect of the invention. 
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Administration of pharmaceutical compositions is accomplished by any effective 
route, e.g. orally or parenteraUy. Methods of parental delivery include topical, intra- 
arterial, subcutaneous, intramedullary, intravenous, or intranasal administration. 
Administration can also be effected by amniocentesis-related techniques. Oral 
administration followed by subcutaneous injection would be the preferred routes of 
uptake; also long acting immobilisations would be used. In addition to the active 
ingredients, these pharmaceutical compositions may contain suitable pharmaceutically 
acceptable carriers comprising excipients and other compounds that facilitate 
processing of the active compounds into preparations which can be used 
pharmaceutical^. Further details on techniques for formulation and administration 
may be found in the latest edition of "REMINGTON'S PHARMACEUTICAL 
SCIENCES" (Maack Publishing Co, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art, in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be 
formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, 
suspensions, etc., suitable for ingestion by the patient. 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to 
achieve the intended purpose. Thus, a therapeutically effective amount is an amount 
sufficient to ameliorate or eradicate the symptoms of the disease being treated. The 
amount actually administered will be dependent upon the individual to which 
treatment is to be applied, and will preferably be an optimised amount such that the 
desired effect is achieved without significant side-effects. The determination of a 
therapeutically effective dose is well within the capability of those skilled in the art. 
Of course, the skilled person will realise that divided and partial doses are also within 
the scope of the invention. 
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For any compound, the therapeutically effective dose can be estimated initially either 
in cell culture assays or in any appropriate animal model. These assays should take 
into account receptor activity as well as downstream processing activity. The animal 
model is also used to achieve a desirable concentration range and route of 
administration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

A therapeutically effective amount refers to that amount of agent, which ameliorates 
the symptoms or condition. Therapeutic efficacy and toxicity of such compounds can 
be determined by standard pharmaceutical procedures, in cell cultures or experimental 
animals (e.g. ED50, the dose therapeutically effective in 50% of the population; and 
LD 5 o, the dose lethal to 50% of the population). The dose ratio between therapeutic 
and toxic effects is the therapeutic index, and it can be expressed as the ratio 
ED50/LD50. Pharmaceutical compositions, which exhibit large therapeutic indices, are 
preferred. The data obtained from cell culture assays and animal studies is used in 
formulating a range of dosage for human use. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED 50 with little 
or no toxicity. The dosage varies within this range depending upon the dosage form 
employed, sensitivity of the patient, and the route of administration. 

The exact dosage is chosen by the individual physician in view of the patient to be 
treated. Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Additional factors, which may be 
taken into account, include the severity of the disease state. Long acting 
pharmaceutical compositions might be administered every 3 to 4 days, every week, or 
once every two weeks depending on half-life and clearance rate of the particular 
formulation. Guidance as to particular dosages and methods of delivery is provided in 
the literature (see, US Patent No's 4,657,760; 5,206,344 and 5,225,212 herein 
incorporated by reference). 
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According to a sixteenth aspect of the invention, there is provided a number of 
screens. A first screen provides for identifying an agent which modulates DPP10 
activity comprising: 

5 providing a polypeptide sequence as claimed in any one of claims 8 to 13; 
providing a DPP10 substrate; . 
providing an agent to be tested; 

measuring whether the agent to be tested modulates DPP10 by measuring processing 
of theDPPIO substrate. 

10 

The components of the screen are combined, in any optional order. 

In the screening assay the DPP10 may be any DPP 10 as claimed in any one of claims 
8 to 13. Fragments of the DPP10 molecule such as the P-propeller domain, the 
15 fibronectin binding domain or another extracellular matrix binding domains may be 
used. Also, DPP10 polypeptides which comprise one or more SNP nucleic acid 
sequences of the present invention, such as described in any one of claims 37 to 41 
may be used. The DPP10 polypeptide may be purified or non-purified. The DPP10 
polypeptide may be soluble. It may comprise one or more of the domains. 

20 

The agent being tested is .being identified for use in the prevention or treatment of a 
DPP10 related or mediated disease or disorder. Such diseases or disorders include: 
asthma, eczema or hayfever (known as atopic diseases), inflammatory bowel disease 
(e.g ulcerative colitis or Crohn's disease), inflammatory joint diseases (such as 
25 rheumatoid arthritis and ankylosing spondylitis), psoriasis, brain diseases involving an 
inflammatory component such as multiple sclerosis or type n diabetes or hypertension. 
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The DPP10 substrate may be any which is processed by a DPP10 polypeptide of the 
invention. By processed is meant any changes which can be measured. The substrate 
may comprise the formula XPXS or more specifically the formula NH 2 X-P(X) y S-(X) D 
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where NH 2 is the amino terminus, X is any amino acid, P is proline, S is serine, y is 1 
to 4, and n is any number. The substrate may be a cytokine containing the XPXS 
motif such as those in Figure 19, namely RANTES, SDF-1, EOTAXJN, IP10, MCP2, 
IL17 P, IL2, GCP-2, DL18bp, chemokine CC-l/CC-3, interieukin-8, CTAK/ALP/LIC, 
5 or a small peptide having such a motif. These substrates may be fluorescently labelled 
or modified to allow easy detection of processing. Such labelling or modification is 
known to the person skilled in the art. 

Typically the processing of the substrate will comprise measuring protease activity, for 
10 example such protease activity may be detected by cleavage of the substrate. 
Modulation is taken to mean either an increase or decrease in activity of the enzyme. 
Such activity may be affected by an alteration in expression of DPP10 or a change in 
the DPPlO's half life or a change in the post-translational modification of the DPP10. 

15 The present invention further provides a screen for identifying an agent which 
modulates DPP10 activity comprising: 

providing a DPP10 polypeptide as claimed in any one of claims 8 to 13; 
providing an agent to be tested; 
20 providing a cell; and 

measuring whether the agent to be tested modulates DPP10 by measuring adhesion of 
the cell to a surface 

Such a screen can be referred to as a cell adhesion screen (or assay). The components 
25 of the screen arc combined, in any optional order. 

Typically cells used in the cell adhesion assay may be maintained in suspension where 
adhesion is measured by aggregation of the cells due to intercellular adhesion 
' molecule interactions. Alternatively, adhesion to a surface may be measured. The 
30 surface may be a non-biological molecule e.g. tissue culture plastic or it may be a 
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biological molecule which is cellular or non-cellular. Examples of a non-cellular 
molecule include extracellular matrix components such as fibronectin, collagen and 
such like. One or more cells or other biological non-cellular molecules may be 
attached to a surface such as a tissue culture surface or an extracellular matrix 
component coated surface. Adhesion is determined by measuring the adhesion of a 
cell to a surface. Modulation in cell adhesion may be either an increase in cell 
adhesion or a decrease in cell adhesion. An agent is considered to be a modulator of 
DPP10 activity if it affects DPP10 activity, this may be either at the level of 
expression of the DPP10 molecule or by altering the half life of the DPP10 molecule 
or by affecting the post-translation modification status of the DPP10 molecule. 

Yet a further aspect of the invention provides a screen for identifying an agent which 
modulates DPP10 activity comprising: 



15 providing a DPP10 polypeptide as claimed in any one of claims 8 to 13; 
providing an agent to be tested; 
providing a cell; 

measuring a change in differentiation or proliferation of the cell. 

20 The components of the screen are combined, in any optional order. 

Typically, differentiation may be measured by any means known to the persons skilled 
in the art for example in the case of a T-Iymphocyte, the change in differentiation can 
be T-cell activation. In the case of other cell types it may be the induction or 
25 prevention of production of a secretable cell signalling factor such as an 

immunomodulator e.g. a cytokine or growth factor, The immunomodulator may be a 
peptide or may be any other biological substance which expression is altered by an 
agent which modulates DPP10. Typically this assay is performed in vitro for example 
in tissue or organ culture. 
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The change in phenotype may be any. It may involve a change in T-cell and/or B-cell 
phenotype. 

Such a screen provides an in vitro model for identifying an agent which modulates 
5 DPP10 activity. 

Yet a further aspect of the invention provides a screen for identifying an agent which 
modulates DPP10 activity comprising: 

10 providing a transgenic animal according to one of claims 23 to 25 or 59; 
providing an agent to be tested; 

contacting the transgenic animal with the agent to be tested; 
detecting a change in the transgenic animals phenotype. 

15 The components of the screen are combined, in any optional order. 

The cell against which the agent is tested may be in suspension, tissue culture, as part 
of an organ or as part of an animal. Preferably the animal is a laboratory animal, such 
• as a rat, rabbit, mouse or other rodent. 

20 

Yet a further aspect of the invention provides a screen for detecting a side effect 
associated with the use of an agent which modulates DPP10 comprising: 

providing a cell which does not substantially express DPP10; 
25 providing an agent to be tested; 

contacting the agent to be tested with the cell; and 
measuring any side effect produced by the agent on the cell. 



30 



The components of the screen are combined, in any optional order. 
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• The side effect to be measured may be any, and may depend on whether the cell is part 
of a larger tissue or animal. It may involve a change in cell differentiation, or cell 
proliferation. The side effect may be a measure of the change of phenotype of an 
organ or animal. 

5 

Yet a further aspect of the invention provides a screen for identifying an agent which 
modulates DPP10 activity comprising: 

providing a DPP10 nucleic acid according to any of claims 1 to 4; 
10 providing an agent to be tested; 

measuring whether the agent to be tested modulates DPP10 by measuring the 
interaction of the agent with the sample of nucleic acid. 

Preferably this screen is an in vitro transcription assay, measuring transcription of 
15 DPP10. 

Alternatively, an agent may be identified by the use of theoretical or model 
characteristics of DPP10. The functional or structural characteristics of DPP10 may 
be of the protein itself or of a computer generated model, a physical two- or three- 
20 dimensional model or an electrical (e.g. computer) generated primary secondary or 
tertiary structure, as well as the pharmacaphore (three dimensional electron density 
map) or its X ray crystal structure. 

Putative agents will include those known to persons skilled in the art or new 
25 substances, and include chemical or biological compounds, such as anti-sense 
nucleotide sequences, polyclonal or monoclonal antibodies which bind to polypeptide 
sequence of the second aspect. 
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According to a seventeenth aspect of the invention, there is provided the use of a 
nucleic acid sequence or polypeptide sequence as defined above in a screen for an 
agent which modulates the activity of DPP10. 

.The method preferably comprises contacting a putative agent with a nucleic acid or 
polypeptide sequence according to an aforementioned aspect of the present invention 
and monitoring expression and/or activity of the nucleotide or polypeptide sequence. 
Potential agents are those which alter the activity or expression of the DPP10 
nucleotide or polypeptide sequence compared to the activity or expression in the 
absence of the agent. The present method may be carried out by contacting a putative 
agent with a host cell, tissue culture, or transgenic non-human animal comprising a 
nucleotide or polypeptide according to the invention, and displaying inflammatory 
disease. 

Also provided are agents identified by the method of the sixteenth or seventeenth 
aspects. 

Preferred features for the second and subsequent aspects of the invention are as for the 
first aspect mutatis mutandis. 

The invention will now be described with reference to the following drawings and 
examples which are included for the purposes of illustration only and are not to be 
construed as being limiting on the invention. 

FIGURE 1 shows the DNA sequence from the chromosone 2 Asthma Locus. 
FIGURE 2a shows the MEX4 predicted exon sequence. 
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FIGURE 2b shows the location of MEX4 within the refined region of linkage 
disequilibrium and relative to marker D2S308. 
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FIGURE 2c shows the full length insert sequence and translation of the foetal 
brain cDNA clone identified by screening with MEX4. The MEX4 sequence 
is in bold and the start codon is underlined. 

5 

FIGURE 2d shows a full length insert sequence of the 3* RACE clone 
amplified from primers in MEX4 FB-1. 

FIGURE 3 is a schematic of the overlap between the MEX4FB-1 clone and 
10 KIAA1492 and between KIAA1492 and AK025075. The full length cDNA 

sequence is comprised of 25 exons. Exons 1 to 12 (partial) are encoded within 
the MEX4FB-1 clone. Exons 3 (partial) to 25 are encoded within the 
KIAA1492 clone. However, the first MET residue available in the KIAA1492 
predicted protein occurs at nucleotide 927, resulting in a predicted ORF of only 
15 1629bp. Addition of the first two exons from MEX4FB-1 clone provides the 

true MET residue and a predicted ORF of 2391bp from the combined clones. 

FIGURE 4 shows the full length sequence of the human DPP10 mRNA 
generated from an overlap of the MEX4FB-1 and 3' RACE clones. The 
20 MEX4 sequence is in bold and the initiation and termination codons are 

underlined. 

FIGURE 5 shows the full length Mouse DPP10 coding sequence. 

25 FIGURE 5a shows the mouse DPP10 full length cDNA sequence. 

FIGURE 6 shows an overview of human DPP10 alternative transcripts 1 to 6, 
the predicted transmembrane domains, exon 2A (Transcript 6), and the BAC 
location by accession number. 
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FIGURE 7 shows a schematic overview of the Mouse DPP10 transcripts. 

FIGURE 8 shows the Mouse DPP10 alternative exons, the predicted peptide 
sequence of transcripts 1 to 4 and the full length sequence of the transcript 1 . 

5 

FIGURE 9 shows the sequences of the human alternate exons lb to lg and 
alternate exon 2A, liver clones 1 to 3, exons 2B to 2G, the predicted peptides 
from Transcripts 1 to 6, the predicted peptides for the exon lb 3 'RACE clone 
transcripts 1 to 5, the human transcript 6 (MEX4-6 "stopper") sequence and the 
10 human 3 'RACE clone sequence. 

FIGURE 10 shows the sequences of the DPP10 exons. The 5'UTR is included 
in exon la and the 3'UTR sequence in exon 25. The coding regions of these 
two exons are in bold. 



FIGURE 1 1 shows the DPP10 predicted protein sequence of 796 amino acids. 



FIGURE 12 shows the amino acid sequence of DPP10. Amino acids 34-54 
(underlined) are predicted to traverse the membrane. Repeat sequences within 
P-propellers are shown in bold italics. Residues homologous to catalytic 
residues are underlined and shaded. 

FIGURE 13 shows Northern blots of DPP10 which demonstrates the presence 
of multiple transcripts. 

FIGURE 14 shows multiple alignment of the catalytic domains of DPP10 
homologues. Asterisks (*) mark the catalytic site positions. 

FIGURE 15 shows a schematic of exon lb 3'RACE transcripts. 
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FIGURE 16 shows a multiple alignment of the transmembrane region of 
DPP10 and homologues. 

FIGURE 17 shows a multiple alignment of the p-propellor domain of DPP10 
5 and prolyl oligopeptidase homologues. 

FIGURE 18 shows the proposed structure of porcine prolyl oligopeptidase 
(Fulop V, Bocskei Z, Polgar L. (1998) Prolyl oligopeptidase: an unusual beta- 
propeller domain regulates proteolysis. Cell 1998 Jul 24;94(2): 161-70). 

10 

FIGURE 19 shows the cytokines containing PxS motifs. Human chemokines 
and cytokines that have a serine within 10 amino acids of a predicted signal 
peptide cleavage site and which contain a PxS motif (where X represents any 
amino acid). The predicted signal peptide cleavage bond is shown as " A " and 
15 PxS motifs are highlighted. The first group of molecules contain the PxS two 

residues after the cleavage site. 

FIGURE 20 shows RT-PCR linking MEX4 with downstream DPP10 exons in 
adult brain cDNA. The expected 519bp product is observed in lane 1 with 
20 MEX4.F1 and MEX4.R1 and the expected 919bp product is observed in lane 4 

with MEX4.F1 and MEX4.R2. The marker (lane M) used is lOObp ladder. 
Negative controls (genomic DNA and water (H 2 0) were included. 

FIGURE 21 shows the sequence alignment of the mouse and human DPP10 
25 transcript 1 peptides. 

FIGURE 22a shows linkage disequilibrium within the asthma locus. 
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FIGURE 22b shows the location of DPP10 exons within the LD map. The 
disposition of the initial exons of DPP10 are shown relative to the LD map. 
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Significant association to the LnlgE and asthma is indicated by arrows above 
the exons. The scale bar indicates a distance of 50Kb. 

FIGURE 23 shows CLUSTAL X multiple sequence alignment of DPPX-L 
5 (DPP6): Genbank Accession P42658 (DPP6); DPPX-S: Genbank Accession 

P42658; DPP4: Genbank Accession AAA53208. 

FIGURE 24 shows theDPPIO alternative lg fetal liver transcripts. 

10 FIGURE 25 shows the BAC/PAC contig at chromosome 2ql4 showing refined 

region of linkage disequilibrium and relative location of marker D2S308. 

FIGURE 26 shows an example of pyrosequencing for SNP genotyping. 

15 FIGURE 27 shows a western blot of diluted human serum samples probed with 

anti-DPPIO c-terminus antibody. 

TABLE la shows SNPs identified in the sequence of Figure 1 (LD region). 

20 TABLE lb shows SNP's genotyped in the sequenced region of Figure 1 (LD 

region). 

TABLE lc shows DPP10 gene SNP outside of the LD region. 

25 TABLE 2 shows PCR primer sequences and positions in the sequence of 

Figure 1. 

TABLE 3 shows associations between asthma and the LnlgE and SNPs. 
30 TABLE 4a shows primer pair sequences used in RT-PCR. 
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TABLE 4b shows RT-PCR expression data. 

TABLE 5 shows regions of sequence conserved between human and mouse. 
5 Co-ordinates are given with reference to Figure 1 . 

TABLE 6 shows PSQ assay oligonucleotides and PCR annealing temperatures. 

TABLE 7 shows standard deviation in DPP10 expression levels ascertained by 
10 Taqman analysis of blood RNA from asthmatics and controls. 

EXPERIMENTAL EXAMPLES 

15 Overview of the experiments performed: 

Novel SNPs were identified by re-sequencing sections of the ~462kb contig around 
marker D2S308 in DNAs from asthmatic and control individuals. These SNPs were 
gentoyped across a panel of asthmatic families from three populations. Association 

20 analysis was performed by transmission disequilibrium tests (TDT), and identified 
several SNPs showing positive association with asthma. The genotype data from all 
the SNPs was used to refine the extent of linkage disequilibrium (LD) around the 
microsatellite marker D2S308 and the associated SNPs. A 113, 792bp region of 
sequence, containing the associated marker and SNPs and termed the Island of LD, 

25 was selected for transcript identification. 

In silico analysis of public sequence databases showed that no known genes map 
within the island of LD. The only expressed sequence tags (ESTs) which map into the 
region are: AA424226, H10825 and AA42637. Therefore extensive DNA sequence 
30 analysis was undertaken to identify coding DNA sequences within the 113792bp 
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island of LD. Using exon prediction algorithms, a putative coding sequence of 60bp 
(designated MEX4) was identified. Screening of a cDNA library with a DNA probe 
containing MEX4 identified a 1301bp cDNA clone (MEX4FB-1). BLASTN analysis 
against the Genbank nucleotide (nr) database identified 100% nucleotide homologies 
5 to 2 sequences - AB040925 and AK025075. Using this information it was possible to 
. assemble a composite full length cDNA clone with a predicted open reading frame of 
2391bp. Subsequent 3' RACE experiments from the end of the MEX4FB-1 clone, 
confirmed this predicted sequence as the C-terminal part of the gene and 3' UTR. 
Further 5* RACE experiments were performed from exon 3 and identified six other 

10 alternate N-terminal exons arranged in five transcript types. Further 3' RACE from 
Mex4/exon la, identified one additional C-terminal exon (2a) which formed part of a 
short cDNA with Mex4/exon la. Genomic structure analysis showed that the gene 
contains twenty five exons and spans a large region of genomic DNA of ~ 1MB. 
Based upon this structure the marker D2S308 falls within intron .1. The sequence of 

15 the clone AB040925, encodes part of exon 3 to exon 25, however because intron 1 is 
several hundred kb in length, no in silico prediction programmes predicted that this 
clone was spliced downstream of exon la. 

RT-PCR and Northern blot analysis was performed to analyse DPP10 expression, 
20 which was revealed to strongly express in a neuroendocrine fashion with evidence of 
multiple transcripts. DPP10 was also shown to be expressed in PBL and expression 
level in asthmatics and controls were assessed by Taqman analysis of total blood 
RNA. 



25 A transmembrane domain is present in the peptides predicted from two of the five 
transcripts. Subsequent cellular localization studies with epitope tagged DPP10 
constructs for transcript 1 and 2, transfected into Hela cells, have confirmed the 
membrane and cytosolic localizations predicted for these two transcripts. 
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BLASTX analysis against the Swissprot protein database detected significant 
homologies with a class of proteins known as dipeptidyl peptidases (prolyl- 
oligopeptidases), the most significant homology being with DPP6. The gene at 
Chromosome 2ql4 represents a novel dipeptidyl peptidase, DPP10. 

5 

EXAMPLE 1 

SNP discovery and association testing 
Subjects and Phenotyping 

The subjects were administered a modified British MRC respiratory questionnaire. 
10 "Asthma" was defined as a positive answer to the questions "Have you ever had an 
attack of asthma?" and "If yes, has this happened on more than one occasion?" 

"Wheeze" was defined as a positive answer to the question "Has your chest ever 
sounded wheezing or whistling?" and "If yes, has this happened on more than one 
occasion?" The total serum IgE was measured in all children. Skin tests to house dust 
15 mite and grass pollen was carried out. 

Three panels of subjects have been studied. . 

Panel A consisted of 80 nuclear families sub-selected from an Australian population 
sample of 230 families. The panel contained a total of 203 offspring forming 172 sib- 
20 pairs. 12% of the children were asthmatic. 



Panel B consisted of 77 nuclear and extended families recruited from asthma and 
allergy clinics in the United Kingdom. These families contained 215 offspring (268 
sib-pairs) of which 56% were asthmatic. 
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PandC consisted of 87 nuclear families recruited through a child attending an asthma 
clinic in the Oxford region. The families contained 216 offspring (148 sibling pairs), 
of whom 44% where asthmatic, 

5 Positional cloning and SNP discovery. 

We built an extended BAC/PAC contig covering 1.5Mb of the locus and sequenced 
approximately 465kb from four contiguous clones surrounding D2S308. SNP 
detection was systematically carried out on regions of DNA that were free of repeats 
by sequencing 5 unrelated subjects and 5 controls with and without asthma and a pool 
10 of DNA from 32 unrelated individuals. 

100 SNIPs were identified with minor allele frequencies > 20%, and 67 of these were 
genotyped on our subjects. SNP typing was by PCR and restriction digestion. In the 
absence of a natural restriction sequence, a primer was modified to generate a site. 

15 (Primers pairs are given in Table 2). Error checking and haplotype generation was 
carried out by the MERLIN computer program (Abecasis 2001). Linkage 
disequilibrium (LD) between markers was assessed by estimation of D' ftom the 
parental haplotypes (Abecasis et al., Am J Hum Genet 59 323-36 (1995)) and 
portrayed by the GOLD program Abecasis et al., Bioinfarmatics 16 182-3 (2000)). 

M) LD was distributed into four distinct islands (A, B, Bii and C) (Figure 22a). The 
border between the A and B island was flanked by the 543WTC91P and 
543WTC122P SNPs (Tables la, lb and lc). 

Association testing 

'5 Association was sought between Asthma and the SNPs by transmission disequilibrium 
tests (Spielman et al Am J Hum Genet 59 983-9 (1996)). (Table 3) (Figure 1). Positive 
associations were confined to the B island of LD (Figure 22b). The strongest 
association was observed with the 543WTC122P SNP, approximately 1Kb proximal 
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to D2S308: weaker associations were observed more distally, between 543WTC110P 
and 317WTC59P (9Kb and 59Kb away) (Table 3). 



Association was also sought between the Loge (IgE concentration) (LnlgE) and the 
5 SNPs by variance components analyses (Abecasis et al Am J Hum Genet 66 279-292 
(2001)) (Table 3). Moderate evidence for association was detected in the A island of 
LD, over a region of approximately 60Kb. The complete separation of LD between 
the A and B islands suggested that this QTL was different to the polymorphism 
affecting asthma status from the B island. The alleles of D2S308 are described in 
10 WO99/50451. 

SNP DP1007, (Table lc), located in the 3' UTR of the DPP10 gene showed positive 
association to Rasti (p=0.002), Psti (p=0.0073) and loge (p=0.0263). 

15 EXAMPLE 2 

Determination of the full length sequence of a novel dipeptidvlpeptidase-like gene, 
DPP10. 

20 Gene identification 

In order to identify genes within the associated sequence region, sequence similarity 
searches were performed using BLAST analysis against the public sequence databases 
(Althschul et al., 1997). No significant sequence identities or similarities to reported 
genes were identified. The only evidence for any coding sequence in the region were 

25 the DNA sequence matches to two ESTs; H10825 (IMAGE clone 46982) and 
AA426377 (IMAGE clone 757495). 
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Identification of conserved exons 

The next stage of analysis was to identify potential coding regions in the 113,792bp of 
DNA sequence by the use of exon prediction software. The sequence from the refined 
region of ID was subjected to exon prediction analysis by submission to the NIX site 
5 at Human Genome Mapping Project (HGMP) Resource Centre 
Oittp://www.hgmp.mrc.ac. ulc/homepap e.htrnn. This site incorporates the following 
exon prediction programmes; Grail/exon, Grail/gap2, MZEF, GENSCAN, Genemark, 
hmm.gene, FGENE, FEX, Genefinder and FGENES. For all DNA sequence analysis, 
default settings were used. 

10 

Over 100 exons were predicted in the region, 66 by more than one programme. The 
group of multiple-predicted exons included a 60bp exon designated as MEX4. The 
sequence of this predicted exon sequence is detailed in Figure 2a. MEX4 was 
identified by the FGENE/FEX software, with significance scores of 0.65 and 0.7 
15 respectively. The location of MEX4 within the refined region of linkage 
disequilibrium and relative to DNA marker D2S308 is shown in Figure 2b. 

PCR primers were designed to putative exon sequences predicted by more than one 
algorithm. PCR products generated from these oligonucleotide primer pairs were used 
20 in low stringency hybridisations against "zoo strips" - Southern blots containing 
restriction enzyme-digested DNA from human and mouse genomic DNA. MEX4 
showed weak hybridisation to the mouse DNA lanes (data not shown) and therefore 
was considered to harbor sequences conserved between the two species, indicative of 
functional, coding, DNA sequences. 

25 

Library screening 

A probe containing the MEX4 sequence was used to screen a commercial foetal brain 
cDNA library prepared in the lambda, -triplex phage vector (Clontech). 
Approximately 1 million phage clones were plated out onto twenty 15cm Luria agar 
30 plates, and duplicate colony lifts performed using 132mm circular nylon transfer 
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membranes (Hybond N+, Amersham). All procedures were performed according to 
the library manufacturer's published protocols 

(http.V/www.clontech^com/libraries/^echinfo . protocol PT3003-1, version PR09529). 

5 The MEX4 probe was labelled with a 32 P-dCTP using a random labelling kit ("Prime- 
it RmT random primer labeling kit, Stratagene) and purified through a G50 Spin 
column (Quick Spin™ columns, Boehringer Mannheim) according to the 
manufacturer's instructions. The library filter set was hybridised using a 
phosphate/SDS buffer (0.5M NaP0 4 , 7% SDS, ImM EDTA pH 8.0) at 65°C for 16 
10 hours. Filters were washed twice with 2L of 2XSSC/0.01%SDS for 30 minutes at 
65°C and once with 1XSSC/0.01%SDS for 30 minutes at 65°C, then exposed to 
autoradiographic film (Kodak X-OMAT) for two days with signal intensifying screens 
(Hyperscreen, Amersham). 

15 Three duplicated positive plaques were identified on the filters. Each pool of positive 
phage plaques was "cored" using a cut-off 1ml pipette tip. The plug of agar and 
plaques were put into a 2ml screw cap microcentrifuge tube containing 1 ml of lambda 
phage dilution buffer (lOOmM NaCl, lOmM MgCl 2 , 35mM Tris-HCl pH 7.5, 0.01% 
gelatin). Phage dilutions of 1:100, 1:1000 and 1:10,000 were prepared and plated out 

20 onto a series of secondary plates. Colony lifts were performed as above and the filters 
hybridised with MEX4 probe as above. 

At this stage one positive clone was chosen to carry forward for further investigation. 
This was named MEX4FB-1. Single hybridising plaques were available for this clone. 
15 One plaque was cored using a cut-off 1ml pipette tip and diluted with 350jil of lambda 
phage dilution buffer. A 10ml Luria broth culture of BM25.8 cells was grown O/N at 
31°C and inoculated with 150^,1 of the recovered phage plaque to perform plasmid 
extraction according to the library manufacturer instructions. 20jil of final plasmid 
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broth was plated out onto a Luria agar plate containing 50ug/ul of ampicillin and 
incubated O/N at 37°C. 

Plasmid DN A preparation 

Two colonies of MEX4FB-1 were picked from the plate and grown shaking at 37°C 
O/N in 10 mis of luria broth supplemented with 50ug/ul of ampicillin. Plasmid DNA 
was isolated using the QIAprep Spin Miniprep Kit (QIAGEN), according to the 
manufacturer's instructions. DNA yields were quantified for sequencing using a DNA 
fluorometer (HoefferLtd). 

Sequencing 

Plasmids were sequenced according to the protocol of the dynamic ET terminator 
cycle sequencing kit (Amersham). Initially forward and reverse vector sequencing 
primers were used, cDNAlibseq.R[dTCCGAGATCTGGACGAGC]and 
cDNAhbseq.R [dTAATACGACTCACTATAGGG]. The DNA sequence generated 
from two plasmid end reads did not overlap, so sequencing was also performed with 
two internal walking primers; 

MEX4-6 FB 21.FW [dTTTGTGCTTCACGATCCAGAGG] ; and 

MEX4-6 FB 21.RW [dGATGTCAGTCGCAATGAACTGC] 

to obtain full lenjth insert sequence. This l,301bp sequence is presented in Figure 2c. 

Identical sequence was obtained from both of the two colonies of MEX4FB-1 picked 

in the plasmid preparation stage. 

Each DNA sequencing reaction used: 400ng template DNA, 0.25 pmol primer, 8ul ET 
Terminator mix and distilled H 2 0 to a final reaction volume of 20ul. The cycle 
conditions used were: 96°C for 30 sec, 50°C for 20 sec, 60°C for 1 min for 25 cycles, 
with a final holding cycle at 4°C prior to DNA purification. Sequencing products 
were purified by gel filtration (plO gel)' followed by ethanol precipitation. Dry DNA 
pellets were resuspended in 2 ul deionised formamide and denatured at 96°C for 2.5 
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irdn, followed by snap-cooling on ice. IjjlI of each reaction was loaded onto a 48cm 
sequencing gel and run for 10 hours on an ABI 377 Sequencer and the data collected 
using filter set A. Sequence traces were analysed using the SEQUENCHER™ 
sequence analysis software, version 4.0.5b5 (Gene Codes Inc.). 

5 

Sequence analysis by BLASTN to determine overlapping clones 
The MEX4FB-1 clone sequence was used in a BLASTN analysis against the Genbank 
nucleotide NR database to identify any sequence matches (Altschul et al., 1997)^ A 
100% nucleotide match was identified from MEX4FB-1 nucleotides 409 to 1301 to 

10 the 5' end of a partial cDNA clone, KIAA1492 (Genbank: AB040925) (Nagase et al., 
2000). This clone extends 3,349bp 3' to the end of MEX4FB-L The full length 
sequence of AB040925 was retrieved via the National Center for Biotechnology 
Information (NCBI) website (http://www.ncbi.nlm.nih.gov/) using the Entrez 
nucleotide search facility. This full length AB040925 sequence was also subjected to 

15 BLASTN analysis against the NR database. This identified a second cDNA clone, 
AK025075, 2,225bp in length with a 99% match with AB040925 over2,210bp. The 
only mismatches were at the 3' end of the clone where a poly- A tail is present in 
AK025075, but not in AB040925. Therefore, it would appear that there may be two 
different 3' ends to the gene as the two cDNAs AB040925 and AK025075 are of 

20 different lengths. A schematic overview of these alignments is presented in Figure 3. 

The first available methionine residue in the AB040925 sequence is 509bp from the 5' 
end of the clone in the +2 reading frame. The predicted open reading frame (ORF) 
from this position is l,629bp in size. Upstream of this methionine are 169 coding 
25 residues, representing a full ORF. This implies that the reported AB040925 sequence 
is not full length. By combining the MEX4FB-1 clone with the overlapping 
AB040925 sequence a start methionine was identified at nucleotide position 165. 
Upstream of this methionine is a single coding residue, preceded by a STOP codon. 
The addition of the MEX4FB-1 sequence therefore provides the correct 5' end and 
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start codon for this gene and results in a predicted ORF of 2,39 Ibp, encoding 796 
amino acid residues. The combined sequence from these three clones, utilising the 
shorter 3' end represented by the sequence of clone AK025075, is presented in Figure 
4, 

5 - 

Sequence analysis by BLASTN to determine genomic structure 
The composite DNA sequence (Figure 4) from the combined cDNA clones was used 
in a BLASTN analysis against the HTGS database (Altschul et al., 1997). This 
resulted in the identification of multiple B AC clones containing portions of the gene 

10 sequence. In total 24 exons were identified within the cDNA sequence (Figures 9 and 
10). By comparison of the relative positions of these BAC clones on chromosome 2, it 
is apparent that the DPP10 gene spans a large region of genomic DNA. 
(http://genome.ucsc.edu/goldenPath/septTracks.html and 
http://www.ncbi.nlm.mh.gov/cgi-bi 

15 Most of the BACs detailed in Tables la to lc are in working draft, unfinished status, 
so current estimates could vary, but it appears that the gene spans approximately 
1.2Mb of DNA at chromosome 2ql4. 

Expression Analysis: 
'20 The primers: 

MEX4.F1 [dAACCA AACTGCC AGCGTGTCC] ; 
MEX4.R1 [dAAGACGGAGTCCTCTACTTCTGG] ; and 
MEX4.R2 [dATGGACCAACTCACACTTTGGAGC] 

were designed to, perform RT-PCR on cDNA from adult brain. This experiment was 
25 performed to confirm that the MEX4 exon is linked to the DPP10 exon sequences 
found in the clones AB040925 and AK025075, in a second cDNA source in addition 
to the isolated foetal brain clone. The primer MEX4.F1 is located at the beginning of 
the MEX4 exon, (DPP10 exon 1), the primer MEX4.R1 is located in DPP10 exon 6 
and the primer MEX4.R2 is located in exon 10. A 519bp product is expected if 
30 MEX4.F1 is used in conjunction with MEX4.R1 and a 919bp product when MEX4.F1 
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is used with MEX4.R2. The PCR conditions used were; ~50ng of cDNA (Marathon 
ready RACE cDNA, Clontech), in a 50ul reaction with lOpmoles of each primer, 
2.5fAM dNTPs 1.5mM Magnesium Chloride, and 0.5 units of Amplitaq Gold in IX 
amplitaq gold buffer II, (Perkin Elmer Biosystems). Cychng conditions were 95° C 
5 for a single 12 minute cycle, then 38 cycles of a 94° C denature for 15secs, 60°C 
anneal temp for 15secs, and 72° C for 30secs, followed by 72° C for 5 minutes. A 
50ng aliquot of genomic DNA was used as a negative control as well as a no-DNA 
control. A 5ul aliquot of each reaction was run out on 1% agarose gel prepared with 
1XTBE using standard methods. The results are shown in Figure 20. A clear band is 
10 seen in the brain lane for each reaction indicating that the MEX4 exon is expressed, 
linked to DPP10 downstream exons in adult brain as well as foetal brain cDNA. 

The 519bp RT-PCR product was gel purified using the QIAquick Gel Extraction Kit, 
(QIAGEN) according to the manufacturers instructions and used as a probe in 

15 Northern blot analysis. A single 50ng labeled probe was hybridised to three purchased 
Northern blots (MTNI, MTNH and MTNUI, Clontech) which represent mRNA from 
23 different tissues. Labeling with oc^P-dCTP was as reported for the cDNA library 
screening. Hybridisation was at 65°C in EXPRESSHYBE™ buffer (Clontech). 
Washing was twice with 500ml of 2XSSC/0.01%SDS for 30 minutes at 50°C and once 

20 with 500mls 1XSSC/0.01%SDS for 30 minutes at 50°C. The filters were then exposed 
to autoradiographic film for two days with signal intensifying screens. 

The results are shown in Figure 13. A doublet band comprised of two separate 
hybridising bands of between 3.6 - 4.1kb was clearly observed in pancreas, testes, 
25 spinal cord and adrenal gland following a 48-hour exposure. Much weaker expression 
of these products was observed in placenta, liver and small intestine. This doublet 
band was also seen in brain, in addition to two additional larger transcripts of 5.2kb 
and 7.5kb. Overall, the strongest expression was observed in brain, pancreas and 
adrenal gland. 
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Additional RT-PCR was performed on cDNA from tissue and cell lines. The RNA 
from different cell lines and tissues was extracted with RNeasy Mini Kit (Qiagen, 
#74104) except human PBL for which PAXgene Blood RNA Kit (Qiagen, #762132) 
was used. cDNA was prepared from the RNA using OMNISCRIPT Reverse 
5 Transcriptase Kit (Qiagen, #205111) followed by PGR with HotStar Taq PCR Kit. 
PCR was performed for 38 cycles (1 min at 95C, 1 min at 54C and 1 min at 72C per 
cycle). A number of different PCR primer pairs were used. These amplified between 
exon la to 7 (transcript 1 specific), exons lb to 7 (transcript 2 specific) and exons If 
and 7 (transcript 5 specific). Primer pairs amplifiying between exons 2 and 7 and 
• 10 exons 19 and 25, which are predicted to be present in all transcripts, were also used. 
The final two primer pairs tested for the presence of mouse transcripts 2 (mus-exon lc 
to 7) and 3 (mus-exon le to 7). These primer sequences are presented in Table 4a. 
The expression data is summarized in Table 4b. 

15 RT-PCR cloning of novel Liver transcripts: 

RT_PCR on liver cDNA using primers in exon lg and 10 resulted in the amplification 
of bands of an unexpected size. These bands were cloned into the TOPO 2.1 PCR 
cloning vector (Invitrogen) according to the manufacturers instructions. Insert 
positive clones were sequenced and the sequence compared to the original DPP10 

20 clone. A total of 3 alternate transcripts were identified (Figure 9 and 24). 

Sequence analysis by BLASTX to determine sequence homologies 

The nucleotide sequence from the combined cDNA clones was used in a BLASTX 

analysis against the Swissprot database (Altschul et al., 1997). This identified a 

25 number of closely related protein sequences from different species. The two most 
closely related proteins were the human dipeptidyl-peptidase VT (DPP6) protein 
(XPJ)04709; P42658), 71% similarity, and the human dipeptidyl-peptidase IV (DPP4) 
protein (AAA52308), 52% similarity. Given the relatedness of the chromosome 2ql4 
predicted peptide to DDP6 and DPP4, it has been named DPP10. DPP6 is also known 

30 as DPPX, and 2 different isoforms of DPPX are known to exist: DPPX-L and DPPX- 
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S. A Clustal-X alignment of DPP10, DPPX-L, DPPX-S and DPP4 is shown in Figure 
23. 

DPP10 protein sequence analysis 
5 The composite DPP10 transcript encodes a predicted protein of 796 amino acids. By 
comparison with DPP6 and DPP4, it is possible to identify a number of domains and 
residues characteristic of Class S9B serine proteases (Figure 11). In DPP10, the serine 
residue of the catalytic triad Ser-Asp-Hfis, is replaced by glycine. In addition, DPP10 
has an NH 2 -terminal cytoplasmic domain of 34 amino acids, a single transmembrane- 
10 spanning domain, a 0-propeller domain (required for catalytic activity) and 5 potential 
N-glycosylation residues. 

Human DPP10 alternative transcript characterisation 

Human 5' and 3' RACE experiments were performed from a range of tissues using 
15 commercially prepared RACE ready cDNA (Marathon Ready cDNA, Clontech) 

according to the manufacturers instructions. The nested primers for 5' RACE were 

designed within DPP10 exon 3, with a check primer in exon 2; 

DPP105* RACE.R1 TCATTGATCCACCGAGCCTCTGG; 

DPP105' RACE.R2 ACCGAGCCTCTGGATCGTGAAGC; and 
20 DPP10 5'RACE.check GCTCACTCATCACTATGTCAG. 

The nested primers for 3' RACE were designed in exons 10 and 11 with a check 
primer in exon 19; 

DPP103 , RACE.F1 AGTCTGTGAGACCACTACAGGTGC; 
25 DPP103'RACE.F2 TGAGATGACATCAGATACGTGGC; and 
DPP103'RACE.check GGAACTTATCTGTAACCAGCTGG. 

First and second round nested PCRs were run out on 1.2% agarose, blotted and 
hybridised with the internal check oligo to confirm that RACE products contained the 
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correct predicted sequence. lOOng of oligo was end labeled with y^pATP using 10 

units of T4 polynucleotide kinase (NEB) and hybridised in a phosphate/SDS buffer 
(above) at 50°C O/N. Filters were washed to 0.5X SSC/0.01%SDS at 50°C. 
Hybridising PCR products were cloned into the PCR2.1TOPO vector (Invitrogen) 
5 according to the manufacturers instructions. White colonies were picked into 125pi of 
luria broth with ampicillin and grown O/N before stamping onto nitrocellulose filters 
(Hybond N+, Amersham) and growing O/N on LB agar plates. Colony filters were 
processed using standard techniques and screened with the internal check oligo to 
identify clones containing the correct sequence. Plasmids were prepared and 
10 sequenced as above. 

In addition 5' and 3' RACE experiments were performed from within MEX4/Exon la. 
In this case no check primer was available and PCR products were directly cloned into 
PCR2.1 TOPO without prior selection by hybridisation. The nested primers used for 
15 5' RACE were MEX4RACE.R1 CTTGATTGTTTTTGAGGGTTGACAC and 
MEX4RACE.R2 GTGAGAACTCCACTTAAGGATGCC and for the 3' RACE were 
MEX4RACE.F1 AACCAAACTGCCAGCGTGTCC and MEX4RACE.F2 
GTCCCATCACATCAAGTGTCAACC. 

20 Five different transcripts were identified, designated 1 to 5 (Figure 6) containing seven 
different exons, designated la to Ig (Figure 9 where MEX4 is Exon la). Transcripts 
1-3 were isolated from brain and foetal brain cDNA and 4 and 5 were isolated from 
pancreas. The sequence of the 3' RACE clone is presented in Figure 2d. This encodes 
from exons 1 1 to 25 including 970 bp of 3 'UTR. 

25 

In addition 3' RACE was also carried out using primers rooted within exon lb. First 
round RACE was performed using DPP101bRACE.Fl 
TGCTGCCATCCGTAAATTGGAGG and second round RACE was performed with 
DPP101bRACE.F2 TGGAGCTGGTATGCTGGTTAGG. PCR products were cloned 
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directly into PCR2.1 TOPO prior to sequencing. A total of five different transcripts 
encoding five short peptide sequences were identified. The arrangement of these 
exons is summarized in the schematic in Figure 15, and the exon sequences and 
predicted peptides from the full transcripts are presented in Figures 2d and 9. 

Identification and characterisation of mouse DPP10 

BLASTN analysis using human DPP10 sequence against the mouse EST database 
identified an EST with 84% nucleotide identity, (Genbank accession number 
BE862767) to the human sequence. This insert of this clone was sequenced and used 
to design nested 5' RACE primers and an internal RACE check oligo. RACE was 
performed as described previously and the PCR products cloned into PCR2.1TOPO. 
Sequence from these clones was used to design a second set of 5' RACE primers. The 
major transcript identified encodes a 2370bp ORF with a predicted peptide sequence 
of 789 residues (Figure 8). The mouse sequence is 84% identical at the nucleotide 
15 level to the human gene. A protein:protein alignment is provided (Figure 21). 



Mouse DPP10 alternative transcript characterisation 

A total of 4 different mouse DPP10 transcripts were characterised designated 1-4, 
20 containing 5 different exons la-le (Figure 7). 

Human vs. Mouse Comparative Sequence Analysis 

Seventeen contigs generated from mouse BAC clones were extracted and formatted 
into a blast database. This database was searched with the full 462.541kb human 
25 sequence by BLASTN (NCBI BLAST 2.2.1) to determine the order of the 17 mouse 
contigs relative to the human sequence. The results-were then used to create a single 
sequence that was composed of the original contigs ordered by the Blast results. This 
sequence and the Human 25Kb sequence were masked for repeats by RepeatMasker 
version 07/07/2001. Human25Kb is a contiguous region of 25 kb human DNA 
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seqeunce encompassing MEX4. Within the sequence Human25Kb, MBX4 is located 
at nucleotides 3235 to 3295. 

The masked mouse contigs combined from 17 pieces sequence was formatted into a 
5 blast database and searched with the masked Human25Kb sequence. Fourteen 
significant HSPs (expectation e-value <1) and with greater than 80% sequence identity 
where identified, the relevant mouse sequence was extracted and formatted into a blast 
database. This database was searched by BLASTN with the masked Human25Kb 
sequence as the query sequence and results obtained (Table 5). These regions of 
10 sequence conservation may serve as regulatory domains for the DPP10 gene. 

SNP discovery in DPP10 gene 

Primer pairs were designed to amplify each exon plus some flanking intronic sequence 
from a panel of 23 control DNAs and 31 asthma patient DNAs. PCR products were 

15 subjected to mutation detection by denaturing high performance liquid 
chromatography (DHPLC) using a Transgenomic WAVE instrument (Transgenomic 
Inc.). Mutation detection temperatures were determined empirically by melting curve 
analysis. DNAs showing heteroduplex profiles were sequenced using PCR direct 
sequencing and analysed using the SEQUENCHER™ sequence analysis software, 

20 version 4.0.5b5 (Gene Codes Inc.) to identify polymorphic bases. An arginine to 
- proline amino acid substitution (C125G) was identified in Exon 10. 

TAAACATACATTTTAATTTTGTTTCCAAACTAGAGAATACTATATCACT 
ATGGTTAAATGGGTAAGCAATACCAAGACTGTGGTAAGATGGTTAAAC 
25 CGAC/GCTCAGAACATCTCCATCCTCACAGTCTGTGAGACCACTACAG 
GTGCTTGTAGTAAAGTGAGTATAATTTATTTTTCTTTTATGCCTAAAAT 
GAAGTAGCTTATGCAGCTTTACAAAGGGGAAACAGGAAATGCTTTGTA 
CAAAAAAAATTCAGTGTTTAACTTTTAAAACTAATAGGAAAAG 
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Additional SNPs within the Figure 1 sequence are presented in Tables la and lb, 
SNPs in the remainder of the DPP10 gene are in Table lc. 

SNP genotyping by Pyrosequencing™ 
5 A pair of oligonucleotides for amplification by PCR was designed on either side of 
each biallelic polymorphism to produce a product size between 50bp and 350bp. A 
sequencing oligonucleotide was designed to end within 30bp either 5' or 3' to each 
polymorphic site. All amplification oligonucleotides used to generate the 
. complementary strand to the sequencing primer were labeled with a 5' - Biotin. (see 
10 Table 7) 

For each marker, all samples genotyped were amplified by PCR using the PCR 
amplification oligonucleotides. Each reaction used: 20ng DNA (dried down), 0.6 units 

of AmpliTaq Gold™ DNA polymerase, IX PCR Buffer II, 2.5mM MgCl 2 , ImM 

f 

15 dNTP, and lOpmol of each PCR oligonucleotide in a final volume of 10|il. The PCR 
cycling conditions used were: 95°C for 12 min, 45 cycles of: 94°C for 15 sec, T A for 
15 sec (Table 2), 72°C for 30 sec, and 72°C for 5 min. 

After amplification the DNA strand of each PCR template complementary to the 
10 sequencing primer was isolated, ready for pyrosequencing (PSQ). To do this, 1) 50^1 
of Dynabead solution (2mg/ml Dynabeads®, 5mM Tris-HCl, 1M NaCl, 0.5 mM 
EDTA, 0.05% Tween 20) was added to the PCR product and shaken at 65°C for 15 
min, 2) the template was transferred using magnets to 50|Lil of 0.5M NaOH for 1 min, 
3) the template was transferred using magnets to 100]jl of IX Annealing buffer 
t5 (20mM Tris- Acetate, 5mM MgAc 2 ) for 1 min, and 4) the template was transferred 
using magnets to 45|il of IX Annealing buffer containing 15pmol of sequencing 
oligonucleotide (Table 2). 
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After template isolation, the sequencing oligonucleotide was annealed to the template 
by denaturing at 80°C for 2min and then cooling to room temperature for 10 min. 
Each marker/sample combination was then sequenced/genotyped by 
pyrosequencing™ on a PSQ96™ (Pyrosequencing AB) (Figure 26). Genotype results 
were stored in the PSQ oracle® database ready for statistical analysis. 

Cellular localisation of transcript 1 and transcript 2: 

HeLa cells were transfected with pcDNA3.1/V5-His-DPP10 (exons la to 25) or 
pcDNA3.1/V5-His-DPP10 (exon 2 to 25) using lipofectamine as transfection reagent. 
After 2 days gene expression, the cells were immuno-stained with mouse antibody 
against V5 followed with anti-mouse antibody conjugated with Alexa Fluor 546 either 
before or after fixation. For pre-fix staining, the immuno-reaction was direct 
performed on living cells using a buffer containing 1% BSA without detergent, and 
then the cells were fixed with 3% paraformaldehyde. For post-fix staining, the cells 
were fixed with 3% paraformaldehyde first and then followed by immuno-staining in 
the presence or absence of 0.1% saponin. The stained cells were observed under Leica 
DMIRE2 fluorescence microscope. 

Over-expressed full length DPP10 shows a membrane protein pattern. Post-fix 
staining revealed that the membrane structure is distributed on nuclear envelope, 
plasma membrane and in cytoplasma. This result was also confirmed by transfection - 
with a GFP tagged vector, plasmid pcDNA3.1/NT-GFP-DPP10 (exon la to 25) into 
HeLa cells. Full length DPP10 is detectable on cell surface. Positive staining was 
obtained from pre-fix immuno-reaction on full length DPP10 expressed cells, which 
suggested that a) full length DPP10 is distributed on plasma membrane; b) it is a 
transmembrane protein and accessible for antibody from outside of the cells. The C- 
terminus of foil length DPP10 is in the extracellular domain of the protein since the C- 
terminal tag V5 was detected by antibody under pre-fixation condition. 
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Over-expressed DPP10 transcript 2 exhibits cytosolic profile. It was positive only 
under post-fixation but not pre-fixation condition. 

Expression Analysis: 

RT-PCR was performed on cDNA from tissue and cell lines. The RNA from different 
cell lines and tissues was extracted with RNeasy Mini Kit (Qiagen, #74104) except 
human PBL for which PAXgene Blood RNA Kit (Qiagen, #762132) was used cDNA 
was prepared from the RNA using OMNISCRIPT Reverse Transcriptase Kit (Qiagen, 
#205111) followed by PqR with HotStar Taq PCR Kit. PCR was performed for 38 
cycles (1 min at 95C, 1 min at 54C and 1 min at 72C per cycle). A number of different 
PCR primer pairs were used. These amplified between exon la to 7 (transcript 1 
specific), exons lb to 7 (transcript 2 specific) and exons If and 7 (transcript 5 
specific). Primer pairs amplifiying between exons 2 and 7 and exonsl9 and 25, which 
are predicted to be present in all transcripts, were also used. The final two primer 
pairs tested for the presence of mouse transcripts 2 (mus-exon lc to 7) and 3 (mus- 
exon le to 7). These primer sequences are presented in Table 4a. The expression data 
is summarized in Table 4b. 

Quantitative PCR to evaluate DPP10 expression in asthmatic and control bloods: 
RNA and cDNA preparati on from Blond . A total of 7.5 mis of blood was collected 
per patient (20 mis per control) into PAXgene™ blood RNA tubes (PreAnalytiX, 
Qiagen/ BD) using a BD Safety-Lok™ blood collection set as described by the 
manufacturers. The PAXgene™ blood RNA tube was inverted 10 times and stored 
between 3 hours and 1 day at room temperature. The samples were then either 
processed or stored for up to 5 days at +4°C. The samples were prepared as described 
by the manufacturers (PreAnalytiX, Qiagen/ BD) instructions. Multiple samples from 
the same individual were pooled into a 2 ml eppendorf tube. The samples were then 
denatured at 65°C for 5 minutes in a heat block. The samples were then placed on ice 
immediately and analysed using the Agilent Technologies 2100 Bioanalyser. The 
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RNA was analysed using the RNA 6000 Nana assay kit (Agilent Technologies) and 
eukaryote total RNA Nano assay. This assay allows quantification of RNA in the 
range 25-500 ng/^il, as well as providing information regarding DNA contamination 
and RNA degradation. 

5 

Reverse transcription was performed using the EndoFree RT™ kit from Ambion. A 
mixture was prepared containing 1 \ig of RNA, 10 pMol anchored oligo(dT) (T 20 VN, 
where V = A, C, or G; N = A, C, G, or T), and 10 pMol random hexamers in a total 
volume of 8 jO. This was denatured at 70°C for 5 minutes in a heat block. This was 

10 then placed immediately on ice for 3 minutes, before being transferred to a PCR 
machine at 22°C and equilibrated for 5 minutes:. The reaction mixture containing 2 pi 
each of the following; 10X RT buffer, dCTP (2.5 mM), dTTP (2.5 mM), dGTP (2.5 
mM), dATP (2.5 mM) and 1 jliJ of RNase inhibitor (10 U/fil) was also equilibrated at 
22°C for 5 minutes. The 11 |il of reaction mixture was then added to the RNA:RT 

15 primer mix at 22°C. Subsequently 1 jxl of reverse transcriptase was added to each tube 
and mixed. The reaction was incubated in a thermal cycler at 22°C for 10 minutes 
followed by 2 hours at 49°C. 

The cDNA -was then stored at -20°C prior to quantification by the Agilent 
20 Technologies 2100 Bioanalyser. The cDNA was quantified using the Agilent 
Technologies 2100 Bioanalyser as described for the RNA (above). The only 
difference in methodology was that the chip was analysed using the eukaryote mRNA 
Nano assay. 

25 Quantitative PCR analysis of cDNA: P LATINUM Quantitative PCR SUPERMIX- 
UDG is used in all amplifications. This mix contains dUTP (instead of dTTP) and 
UDG (uracil-N-glycosylase, UNG) which removes uracil residues from single or 



WO 02/0861 




PCT/GB02/01887 



70 



double stranded DNA. Therefore, dU-containing DNA which has been digested with 
UDG (preliminary incubation of PCR reaction at 50°C for 2 minutes before cycling) is 
unable to serve as template in future PCRs, therefore preventing the reamplification of 
PCR carryover products. At high temperatures (during cycling) UDG is deactivated, 
5 therefore allowing amplification of genuine targets. 

Amplification was performed in a BioRad iCycler iQ™ Multi-Colour Real-Time 
Detection System. Assays were performed for DPP10 (exons 15 to 17) and for 
actin. PCR cycling parameters: A single cycle of 50°C for 2 minutes then a single 

10 cylcle of 95°C for 10 minutes followed by 95°C for 30 seconds and 61°C for 30 
seconds for 50 cycles. Samples were then held at 4°C. The oligos used were; P-Actin 
forward primer: TGCGTGACATTAAGGAGAAG, P-Actin reverse primer: 
GCTCGTAGCTCTTCTCCA and p-Actin Taqman probe: 

CACGGCTGCTTCCAGCTCCTC {labelled with FAM and quenched with Black 

15 Hole Quencher 1}. The final concentrations of PCR reaction components for the P- 
Actin assay were as follows; 10 ng cDNA, 125 nM of forward and reverse primer, 150 
nM probe, 0.6 units of PLATINUM Taq DNA polymerase, 20 mM Tris-HCl (pH 8.4), 
50 mM KC1, 4 mM MgCl 2 , 200 jxM dGTP, 200 fxM dATP, 200 fxM dCTP, 400 |LiM 
dUTP, 0.4 units of UDG and stabilizers. 

20 

DPP10 forward primer: TTGATGCCAGTTTTAGTCCC, DPP10 reverse primer: 
TCAGGATAGCTTCCTTCAGC and DPP10 Taqman probe: 
AGGGTCCCAGTGGTCAGCCTACATA {labelled with HEX and quenched with 
Black Hole Quencher 1}. The final concentrations of PCR reaction components for 
25 the DPP10 assay were as follows; 10 ng cDNA, 150 nM of forward and reverse 
primer, 100 nM probe, 0.6 units of PLATINUM Taq DNA polymerase, 20 mM Tris- 
HCl (pH 8.4), 50 mM KC1, 3 mM MgCl 2 , 200 mM dGTP, 200 pM dATP, 200 mM 
dCTP, 400 \)M dUTP, 0.4 units of UDG and stabilizers. 
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Assaying ofDPPIO in human serum 

Serum was isolated from four volunteers (WT1-4) using Vacuette sample tubes. Each 
serum sample was diluted (1/20, 1/40, 1/80) in sample buffer (7.5mM Tris pH6.8, 
3.8% SDS, 4M, Urea, 20% glycerol, 5% mercaptoethanol) to a total volume of 20|xl 
5 and denatured at 95°C for 5 minutes. The samples were loaded onto a 12% 

polyacrylamide SDS denaturing gel and electrophoresed 200V for 60 minutes. After 
electrophoresis the proteins were transferred to a 0.4pm nitrocellulose membrane by 
blotting at 200V for 2 hours. The filter was blocked overnight in 5% milk solution at 
4°C prior to antibody detection. 

10 

The affinity purified DPP10 C-terminal antibody was generated against a DPP10 
peptide (NH2-CLK-EEI-SVL-PQE-PEE-DE) in rabbits. The filter was incubated with 
the DPP10 Ab (1/250) in 5% milk at RT for 60 minutes. After washing, the filter was < 
incubated with anti-rabbit IgG conjugated AP (1/2000) in 5% milk at RT for 60 
15 minutes. After a final rigorous washing step, bound antibody was detected by 

chemiluminescence substrate (Roche) and autoradiography. Figure 27 shows the 
result of such a blot demonstrating the presence of DPP10 in human serum. 

Discussion 

20 

A 462kb BAC contig was constructed around the asthma associated microsatellite 
marker D2S308 and sequenced. A number of novel SNPs identified in the region were 
genotyped across a panel of asthmatic families. TDT testing of association revealed a 
number of markers with strong association to asthma. Genotype data was used to 
25 refine the extent of linkage disequilibrium (LD) around the microsatellite marker and 
the associated SNPs. A block of linkage disequilibrium containing the associated 
marker sequences was identified in which theasthma susceptibility locus was predicted 
to lie. 
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Examination of the public EST databases did not identify any clones from the region 
that contained an open reading frame (ORF), or clones that could be extended by 5' or 
3' RACE. Exon prediction was therefore carried from genomic sequence. Twenty- 
seven potential exons were identified by at least two exon prediction programs. Exons 
that were free of repeat sequences and were at least 50bp in length were amplified and 
used in pools to screen a panel of commercial (Clontech) cDNA libraries (foetal brain, 
lung, testis, trachea and skeletal muscle). 

Twenty-nine cDNA clones were identified with screening at moderate stringency. 
Twenty three of these did not contain chromosome 2 sequences. Five other clones 
consisted of contiguous genomic sequence from our region of interest: they contained 
no ORFs and were attributed to genomic DNA contamination of the cDNA libraries. 
One clone, MEX4FB-1, contained a 1301bp insert with a 1137bp ORF. The clone 
contained a 60bp exon (MEX4) that had been predicted by three programs. The full 
sequence of MEX4 was present at the 5* end of the ORF. Our exhaustive searching of 
libraries and 5' and 3* RACE experiments with all potential exons, suggests that 
MEX4FB-1 represents the only gene expressed from the region. 

BLASTN analysis against the Genbank NR database with the MEX4FB-1 sequence 
identified an overlap between the 3' end of MEX4FB-1 and the 5' end of a partial 
cDNA clone, KIAA1492. This clone extended a further 3349bp from the 3' end of 
MEX4FB-1, and the sequences together encoded a full-length cDNA. Further searches 
with the full sequence identified an additional clone (AK025075) that contained a 3' 
poly A tail upstream to that found in KIAA1492. Repeated 3' RACE experiments in 
different tissues only identified the AK025075 3' UTR termination. 

The complete 3.6Kb cDNA of the gene results in a predicted ORF of 2391bp, or 796 
residues (Figure 2c). The gene was shown to contain 25 exons by BLASTN analysis 
against the HTGS database (Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: a 
new generation of protein database search programs. Nucleic Acids Res 25, 3389-402. 
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(1997)). The relative positions of (unfinished) BAC clones containing exons on 
chromosome 2 Oittp://genome.ucsc.edu/goldenPath/septTracks.html > > suggest that the 
gene spans more than 1Mb of genomic DNA. 

5 In particular the size of intron 1 is large, spanning over 500kb and containing at least 7 
alternate exon l's (a to f), therefore In-silico gene prediction programmes are unable to 
identify these exons as potential N-termini to the DPP10 gene. The RACE 
experiments in this invention establish exons la to If as real alternate N-termini of 
DPP10. Knowledge of these exons as the N-termini of the DPP10 transcripts serves 

10 three purposes in this invention. Firstly they enable prediction of the full length open 
reading frame. In the case of exons la and If the start codon is encoded within the 
exon sequence. Exons lb, lc, Id, le and lg are non-coding, but enable prediction of a 
start codon in exon 2. Three different DPP10 peptides are predicted in this invention 
from transcripts 1 to 5. Secondly, these peptide predictions indicate that both 

15 membrane bound and cytosolic forms of the protein exist, a proposal subsequently 
confirmed experimentally with cellular localization experiments using transcripts land 
2. Thirdly, the location of alternate exons la, lb and lc within the associated block of 
LD containing marker D2S308, serve to tie in the DPP10 gene with a role in the 
asthma phenotype. 

20 

Northern blots showed the gene to be expressed in a neuro-endocrine manner, with 
transcripts in brain, pancreas and adrenals (Figure 13). Lower levels of transcription 
were observed in trachea and small intestine. Multiple splice variants were visible in 
the brain and spinal cord. 

25 

Screening of mouse cDNA libraries with the gene isolated a clone (BE862767) which 
was extended to a full-length cDNA by three rounds of 5' RACE. The major transcript 
identified encodes a 2370bp ORF with a predicted peptide sequence of 789 residues. 
The mouse sequence is 84% identical at the nucleotide level to the human gene. 
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Gene homology 

The MEX4FB-1/KIAA1492 protein is a member of a family of dipeptidyl 
aminopeptidases (DPPs), so that we have named it as DPP10. The tertiary structure of 
the gene family is represented by pig prolyl oligopeptidase (Fulop et al (1998) (Figure 
2a). This enzyme consists of two domains. The first is a regulatory. -propeller that 
funnels small substrates through an internal cavity towards an active site within a 
second C-terminal.(/ hydrolase) catalytic domain. 

P-propeller blades are made up by repeat sequences, sequence similarity between the 
DPP10 and pig prolyl oligopeptidase repeats (Fulop et al (1998)) is low in the known 
3D structure, but can be mapped to the DPP10 sequence. These differences in 
sequence may provide substrate specificity by the widening or nanowing of the 
entrance to the P-propeller cavity. 

The second, catalytic, domain in pig prolyl oligopeptidase contains an active site triad: 
Ser554, Asp641 and His680. DPP10 lacks a serine from this catalytic triad, which is 
substituted by a glycine residue. 



Dipeptidyl aminopeptidase IV (DPP4) is a closer homolog to DPP10 (Misumi et al 
(1992)). It is also known as CD26 (Fleischer et al (1994)), and binds proteins 
including CD45 and adenosine deaminase (ADA) on human T lymphocytes (Kameoka 
et al (1993)). It. is also constitutively expressed on renal proximal tubular epithelial 
cells, epithelial cells in the small intestine, and biliary canaliculae (van der Velden 
(1999)). 

DPP4 has a catalytic activity that removes N-terminal dipeptides sequentially from 
polypeptides having unsubstituted N-termini, provided that the penultimate residue is 
proline. Known substrates include stromal cell derived factor-1, and macrophage 
derived chemokine (Lambeir et al (2001)). DPP4 exists in a soluble version (Durinx et 
al (2000)), suggesting the possibility of a similar 'isoform for DPP10. DPP4 forms 
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hetero-dimers with another DPP homologue, FAP which is a cell surface antigen 
selectively expressed in reactive stromal fibroblasts of epithelial cancers, granulation 
tissue of healing wounds, and malignant cells of bone and soft tissue sarcomas Scanlan 
et al (1994)). This suggests that DPP10 may also form hetero or homo-dimers. 

DPP6 is the closest homolog to DPP10. It was originally isolated from human 
hippocampus (Yokotani et al (1993)). In common with DPP10, DPP6 lacks a serine 
from its catalytic triad, due to a substitution by aspartic acid. This substitution is also 
observed in bovine and rodent DPP6 sequences. A further homologue, Drosophila 
melanogaster CG9059, also retains aspartic acid and histidine catalytic residues, but 
like DPP10, has replaced the active site serine with a glycine. The conservation of the 
catalytic histidine and aspartic acid residues in the absence of the catalytic serine in 
DPP10 and several homologues suggests an evolutionary constraint and the possible 
retention of some catalytic function. 

Although there is no evidence for another serine elsewhere in the enzymatic domain 
that could substitute for the active site serine, it is possible that the catalytic serine 
might be provided by the substrate (Dall'Acqua et al (2000)). As DPP cleavage takes 
place only at sites where a penultimate residue is proline, catalytic serines may be 
provided by substrates which contain a PxS motif with a serine at + 2 after the proline 
at the cleavage point We therefore searched for PxS motifs amongst a redundant list of 
approximately 1000 human cytokine amino acid sequences from the Entrez database 
(http://www.ncbi.nlm.nih. gov/entrezV The sequences were filtered using a perl script 
and the sigcleave module from Bioperl (http://bioperl.orgV All sequences with a signal 
peptide that was 20 amino acids or fewer from the N-terminus and a PxS tripeptide 
starting at the +2 position were identified. As a control, an identical protocol was used 
to detect cytokines with an SxP tripeptide at the +2 position, and and none were found. 
Amongst the sequences containing the PxS motif at the +2 position are several key 
inflammatory cytokines and chemokines, including SDF-1, EP10, Eotaxin and 
RANTES (Figure 19). A mechanism is thus suggested for DPP10 to modulate 
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asthmatic airway and other inflammation by activation of these cytokines. The 
structure of the DPP10 protein further suggests that small molecules may inhibit its 
enzymatic activity, so that pharmaceutical targeting of DPP10 will provide a novel 
means of inhibiting asthmatic airway and other inflammation. 

5 

Alternative splicing and gene expression 

5' RACE was performed using primers within exon HI of DPP10. Five different N- 
termini were identified. The corresponding cDNAs were designated 1 to 5. These 
contained seven different exons, designated la to lg (Figure 6). Transcripts 1-3 were 

0 isolated from brain and foetal brain cDNA and 4 and 5 were isolated from pancreas. 
Exons la and If (transcripts 1 and 4) provide alternate N-terminal coding sequences, 
23 residues upstream of the start of exon 2. The other N-terminal exons are non- 
coding and the next available initiation codon is within exon 2. Therefore three 
predicted proteins are encoded by these five transcripts. Two of these, initiated within 

5 exon la and exon If, contain a trans-membrane region, suggesting that the majority of 
the protein will be located in the cytosol. 

3' RACE from exon la identified an alternate exon 2 (2a), which is 6655bp 
downstream of the la exon (Figure 6). This transcript was identified in cDNA from 

1 brain and testis, and. encodes a 47 residue peptide. Alternative splicing between this 
"stopper" exon and exon 2 and the rest of the gene offers a potential mechanism for 
regulating the membrane expression of the complete gene product. 

We then positioned these alternate early exons on the LD map of the region. 
Associations with the total serum IgE level mapped to the A island of LD, close to 
exon la. The two markers showing the strongest association to asthma were located at 
the beginning of the B LD island, in close proximity to the 2a stopper exon. Weaker 
associations to asthma were seen near exons lb and lc. Other exons are outside of the 
region of association. 
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No coding polymorphisms were found in any exons, and we typed all SNPs in 
<100Kb proximity to any coding sequences. Genetic effects at this locus may 
therefore be attributed to the actions of polymorphism on alternative splicing between 
membrane bound and other forms of the protein. Similar effects have been observed 
5 with the mapping of distinct asthma-associated traits to individual polymorphisms that 
affect splicing of the IL4-receptor gene (Ober et al (2000) and Kruse et al (1999) and 
Kruseetal(1999). 

Protease assay 

10 DPP10, either purified from expression systems or from cell lystes is incubated with 
peptides to be tested as putative substrates e.g. chemokines such as RANTES, eotaxin. 
At varying time intervals, samples are withdrawn and quenched in TCA. The samples 
are then desalted, eluted and the composition of the mixture identified with a mass 
spectrometer. An example of such a procedure can be found in Lambeir et al. 2001 J. 

15 Biol Chem 276, 29389-29845. 

Cell adhesion assay 

To address whether full length DPP10 protein is involved in cell adhesion, cell-cell 
adhesion assay (Gee B. E. and Piatt O. S. 1995 Sickle Reticulocyes adhere to VCAM- 

20 1. Blood 85(l):268-274.) and Rosetting assay (DeRose V. et al 1994 Substance P 
increases neutrophil adhesion to bronchiaL epithelial cells. /. Immunol 152(3): 1339- 
1346; Walsh G. M. et al. 1991 Human eosinophil, but not neutrophil, adherence to IL- 
1 -stimulated human umbilical vascular endothelial cells is a401 (very late antigen-4) 
dependent. /. Immunol 146(10):3419-3423) is performed, In brief, HeLa or COS cells 

25 are transfected with pcDNA3.1/V5-ffis-DPP10(full) using Lipofectamine as 
transfection reagent. After 2 days gene expression, for cell-cell adhesion assay, the 
cells are be incubated with leukocytes with occasional rotation. Unbound cells are 
washed away before fixation, and then followed by immunostaining. The cell binding 
is determined under microscope. For Rosetting assay, after gene expression, the cells 

30 are detached by trypsine/EDTA. The cell suspension are mixed with leukocytes. After 
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incubation, the cells are fixed onto glass slides followed by staining. The rosette is 
scored under microscope. 



Cellular signalling/activation assays (for example Lymphocyte immunoreaction). To 
understand how DPP10 protein is directly involved in lymphocyte immunoresponse, 
following points are addressed: 

1) Is DPP10 level regulated by the physiological condition of lymphocytes? Primary 
T- and B-cells which have been confirmed to be DPP10 positive, and DPP10 
positive T-cell lines are immunostimulated and used for this study. DPP10 level is 
compared between resting and active form of these cells with quantitative rtPCR. 

2) Does DPP10 expression affect lymphocyte immunoresponse? DPP10 is 
overexpressed in DPP10 negative T-cell (may be also B-cell) lines by transfection 
with pcDNA3.1/V5-His-DPP10(full) or pcDNA3.1/V5-His-DPP10(Tpt2). After 
gene expression, the cells are immunostimulated. The intensity of the 
immunoresponse is compared between DPP10 overexpressed cells and control 
cells using cytokine assay or proliferation assay. 

3) How is DPP10 involved in lymphocyte immunoresponse? Lymphocytes are 
transfected- with pcDNA3.1/V5-His-DPP10(full) or pcDNA3.1/V5-His- 
DPP10(Tpt2). The effect of DPP10 overexpression on different signal 
transduction pathways, such as P1-3K, ERK, Jak/Stat and NFkB pathways, is 
noted with Western blot, in vitro assays. 
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1. An isolated nucleic acid sequence comprising a DPP10 mRNA sequence. 

2. An isolated nucleic acid sequence according to claim 1 where the sequence 
encodes a human DPP10, or a sequence complementary or substantially homologous 
thereto, or a fragment thereof. 

3. An isolated nucleic acid sequence according to claim 1 where the sequence 
encodes a mouse DPP10, or a sequence complementary or substantially homologous 
thereto, or a fragment thereof. 



4. An isolated nucleic acid sequence according to any of claims 1 to 3 comprising 
one or more exons of DPP10, or a sequence complementary or substantially 

15 homologous thereto, or a fragment thereof. 

5. Use of a sequence of any of claims 1 to 4 for regulating DPP10 expression. 

6. Use of the sequence of any of claims 1 to 4 for the manufacture of a 
20 medicament for the regulation of DPP10 expression. 

7. A vector comprising the isolated nucleic acid sequence of any of claims 1 to 4 
to enable in vitro or in vivo expression of DPP 10. 

25 8. A polypeptide sequence encoded by the isolated nucleic acid of any of claims 1 
to 4 or a sequence substantially homologous thereto, or a fragment thereof. 



9. ^ A polypeptide sequence according to claim 8 where the polypeptide is a 
soluble DPP10 protein lacking a transmembrane domain. 
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10. A soluble DPP10 protein according to either claims 8 or 9 which is operably 
linked to a secretion signal, 

11. A soluble DPP10 protein according to claim 10 where the protein comprises a 
5 Histidine tag. 

12. A fusion protein comprising the polypeptide of claims 8 or 9 or the soluble 
protein of claims 10 or 1 1 where the fusion protein or protein is linked to a carrier. 

10 13. A polypeptide sequence according to claims 8 or 9, or protein according to any 
of claims 10 to 12 where the protein is post-translationally modified. 

■ 14. An antibody specific for the polypeptide sequence of claims 8 to 13 or the 
isolated nucleic acid of claims 1 to 4. 

15 

15. An antibody which reacts with an antigen of a polypeptide according to claims 
8, 9 or 13 or protein according to any of claims 10 to 13. 

16. An antibody according to claim 14 which is specific for the soluble form 
20 of DPP10, the 0-propeller domain, the external domain or the catalytic domain. 

17. An antibody according to claim 15 which reacts with the soluble form of 
DPP10, the P-propeller domain the external domain or the catalylic domain. 

25 18. An antibody according to claim 16 or 17 where the antibody is a chimeric 
antibody or is humanised. 

19. Use of the antibody of any of claims 14 to 18 in an assay for detecting or 
measuring DPP10 in a sample. 
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20. A process for the preparation of a nucleic acid sequence according to any of 
claims 1 to 4 comprising ligating together successive nucleotide and/or 
oligonucleotide residues. 

5 .21. A process for the preparation of a polypeptide according to claims 8 to 13 
comprising ligating together successive amino acids and/or oligopeptides. 

22. A process according to claim 21 where the polypeptide or protein is produced 
in a cell free system. 

10 

23. A transgenic non-human animal comprising the vector of claim 7. 

24. A transgenic non-human animal that does not substantially express DPP10. 

15 25. A transgenic non-human animal that encodes a variant of DPP10 which results 
in disease. 

26. A method of diagnosing, or determining susceptibility of a subject to 
inflammatory disease comprising determining the presence of a variant of DPP10 

20 which is associated with a disease state, or measuring the level of DPP10, in a sample. 

27. A method for diagnosing disease or predisposition to DPP10 related disease, 
comprising determining the presence or absence of a risk allele of a SNP at position 
259007, 267901 and/or 318524 of Figure 1, wherein presence of the risk allele is 

25 diagnostic of disease or predisposition to disease. 

28. A method according to claim 27 wherein the risk alleles are any nucleotide 
residue other than adenine at position 259007; any nucleotide residue other than 
adenine at position 267901 and any nucleotide residue other than thymine at position 

30 318524 of Figure 1, 
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29. A method according to claims 27 or 28 wherein the risk alleles are a cytosine 
residue at position 259007; a guanine residue at position 267901 and a cytosine 
residue at position 3 1 8524 of Figure 1 . 

5 

30. A method accordingly to any of claims 27 to 29, further comprising 
determining the presence or absence of a risk allele of one or more of the SNPs of 
Table la, lb, lc or Table 3. 

10 31. A method according to any of claims 27 to 30, wherein the method is 
performed on a sample. 

32. A method according to any of claims 27 to 31 comprising removing a sample 
from a subject, and isolating nucleic acid therefrom. 

15 

33 . A method of preventing or treating disease in a subject wherein 

the method comprises modulating the activity, expression, half life or post 
translational modification of DPP10. 

20 34. A method according to claim 33 where the disease is inflammatory bowel 
disease, asthma, atopy, rheumatoid arthritis or psoriasis. 

35. A method of treating or preventing disease according to claim 33 or claim 34 
comprising determining the presence or absence of a risk allele of a SNP at position 

25 259007, 267901 and/or 318524 of Figure 1; and if a risk allele is present, 
administering treatment in order to prevent, delay or reduce the disease. 

36. A method according to any of claims 33 to 35 where the method comprises 

30 administration to a subject of an agent capable of modulating the effects of the 
disease-causing allele. 
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37. A method according to any of claims 33 to 35 where the disease is 
inflammatory disease, such as inflammatory.bowel disease, asthma, atopy, rheumatoid 
arthritis or psoriasis. 

5 

38. An isolated nucleic acid molecule comprising a SNP in a DPP10 nucleic acid 
molecule. 

39. An isolated nucleic acid molecule as claimed in claim 38 comprising part of a 
10 sequence of Figure 1, and comprising one or more SNPs at positions which correspond 

to the positions of Figure 1 listed in any one or more of Tables la, lb, Ic or 3. 

40. An isolated nucleic acid molecule comprising a SNP at the position 
corresponding to position 318524 of Figure 1, or at the position corresponding to 

15 position 259007 of Figure 1, or at the position corresponding to position 267901 of 
Figure 1. 

41. An isolated nucleic acid molecule which hybridizes under stringent conditions 
to a sequence of any one of claims 38 to 40. 

20 

42. An isolated nucleic acid molecule according to claim 41, which is capable of 
distinguishing between alleles of a SNP of Table 1 . 

43. A primer sequence as described in Table 2. 

25 

44. A vector comprising an isolated nucleic acid molecule of any one of claims 38 
to 43. 

45. A host cell comprising a vector of claim 44 or isolated nucleic acid molecule of 
30 any one of claims 38 to 43. 
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46. A polypeptide sequence encoded by the isolated nucleic acid of any of claims 
38 to 42 or a sequence substantially homologous thereto, or a fragment thereof. 

47. A polypeptide sequence according to claim 46 where the polypeptide is a 
soluble DPP10 protein lacking a transmembrane domain. 

48. A soluble DPP10 protein according to either claims 46 or 47 which is operably 
linked to a secretion signal. 



49. A soluble DPP10 protein according to claim 48 where the protein comprises a 
Histidine tag. 

50. A fusion protein comprising the polypeptide of 46 or 47 claims or the soluble 
protein of claims 48 or 49 where the polypeptide or fusion protein is linked to a 
carrier. 

51. A polypeptide sequence according to claims 46 or 47, or protein according to 
any of claims 48 to 50 where the protein is post-translationally modified. 

52. An antibody specific for the isolated nucleic acid of claims 38 to 42 of the 
polypeptide sequence of claims 46, 47 or 51; or protein of claims 48 to 51. 

53. An antibody which reacts with an antigen of a polypeptide according to claims 
46, 47 or 51 or protein according to any of claims 48 to 57. 



10 



54. An antibody according to claim 52 or 53 where the antibody is a chimeric 
antibody or humanised or bifunctional. 
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55. Use of the, antibody of any of claims 52 to 54 in an assay for detecting or 
measuring a DPP10 polymorphism in a sample. 

56. A host cell comprising the vector of claim 7 or claim 44 for producing 

5 recombinant DPP10 gene products, or for use in the regulation or analysis of DPP10. 

57. A host cell comprising the vector of claim 7 or claim 44 for producing 
recombinant DPP10 gene products, or for use in drug screening systems to identify 
agents for use in diagnosis or treatment of individuals having or being susceptible to 

10 inflammatory disease. 

58. Use of the host cell of claim 57 for use in drug screening systems to identify 
agents for use in diagnosis or treatment of individuals having or being susceptible to 
inflammatory disease. 

15 

59. A transgenic non-human animal comprising a vector of claim 7 or claim 44 or 
isolated nucleic acid molecule of any one of claims 38 to 42. 

60. A kit for diagnosis of disease or predisposition to disease, comprising a means 
20 for determining the presence or absence of a risk allele of a SNP of Table la, lb, lc or 

Table 3, wherein the risk allele is diagnostic of disease or predisposition to disease. 

61 . A kit accordingly to claim 60, comprising means for determining the presence 
or absence of a risk allele of a SNP at position 259007, position 267901, and/or 

25 position 3 1 8524 of Figure 1. 

62. A method of identifying a compound for treatment of disease, comprising (a) 
administration of a compound to tissue comprising an isolated nucleic acid molecule 
comprising a SNP at a position listed in Table la, lb, lc or Table 3; and (b) 

30 determining whether the compound modulates downstream effects of the SNP. 
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63, An agent or antibody for use in preventing or treating inflammatory disease 
such as inflammatory bowel disease, asthma, atopy, rheumatoid arthritis or psoriasis. 

5 64. Use of an agent in the manufacture of a medicament for use in preventing or 
treating inflammatory disease such as inflammatory bowel disease, asthma, atopy, 
rheumatoid arthritis or psoriasis. 



10 



15 



65. A pharmaceutical composition comprising a nucleic acid according to any of 
claims 1 to 4 or 38 to 42 or a polypeptide according to any of claims 8 to 13 or 46 to 
51. 

66. A pharmaceutical composition comprising an antibody according to any of 
claims 14 to 18, 52 or 53. 

67. A screen for identifying an agent which modulates DPP10. activity comprising: . 



providing a DPP10 polypeptide sequence as claimed in any one of claims 8 to 13; 
providing a DPP10 substrate; 
20 providing an agent to be tested; 

measuring whether the agent to be tested modulates DPP10 by measuring processing 
of the DPP10 substrate. 

68. A screen according to claim 66 where the substrate is a molecule having the 
25 XPXS motif, a molecule having the generic formula of NH 2 X-P(X) y S-(X) n , or a 
chemokine. 



30 



69. A substrate according to any one of claims 67 to 68 where the XPXS motif or 
the generic formula NH 2 X-P(X) y S-(X) n is present in a small peptide molecule. 
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70. A substrate according to any one of claims 66 to 69 where the substrate is 
fluorescently labelled. 

71 . A screen according to any one of claims 1 to 4 where the DPP10 polypeptide is 
5 purified. 

72. A screen according to any one of claims 67 to 71 where the measurement of 
the processing of the substrate comprises measuring protease activity. 

10 73. A screen according to any one of claims 67 to 72 where the DPP10 polypeptide 
is expressed by a cell. 

74. A screen according to any one of claims 67 to 73 where the agent to be tested 
is anon-biological molecule or a biological molecule. 

15 

75. A screen for identifying an agent which modulates DPP10 activity comprising: 

providing a DPP10 polypeptide as claimed in any one of claims 8 to 13; 

providing an agent to be tested; 
20 providing a cell; and 

-measuring whether the agent to be tested modulates DPP10 by measuring adhesion of 
_the cell to a surface. 

76. A screen according to claim 75 where the surface is the surface of a further 
25 cell. . - 

77. A screen according to any one of claims 75 or 76 where the surface comprises 
a non-biological molecule. 
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78. A screen according to any one of claims 75 to 77 where the surface is a 
biological molecule. 

79. A screen according to any one of claims 75 to 78 where one or more of the 
5 cells are immobilized. 

80. A screen according to any of claims 75 to 79 where one or more of the cells 
are a lymphocyte. 

10 81. A screen according to any of claims 75 to 80 where one or more of the cells is 
a cell transfected of the vector of claim 7 or claim 43 or is the host cell of claim 44 or 
45. 

82. A screen for identifying an agent which modulates DPP10 activity comprising: 

15 

providing a DPP10 polypeptide as claimed in any one of claims 8 to 13; 
providing an agent to be tested; 
providing a cell; 

measuring a change in differentiation or proliferation of the cell. 

20 

83. A screen according to claim 82 where the cell is expressing DPP10, as claimed 
in any one of claims 8 to 13. 

84. A screen according to any one of claims 82 or 83 where the cell whose 
25 differentiation is measured is a T-lymphocyte. 

85. A screen according to any one of claims 82 to 84 where the change in cellular 
differentiation is T-cell activation. 
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86. . A screen according to any of claims 82 to 84 where the change in cellular 
differentiation involves a change in expression of a cell signalling factor. 

87. A screen according to claim 86 where the cell signalling factor is an 
5 immunomodulator or a peptide regulatory factor. 

88. A screen according to any of claims 82 to 87 where the cell is cultured 
following removal from a patient or experimental animal. 

10 89. A screen for identifying an agent which modulates DPP10 activity comprising: 

providing a transgenic animal according any one of claims 23 to 25 or 59; 
providing an agent to be tested; 

contacting the transgenic animal with the agent to be tested; 
15 detecting a change in the transgenic animals phenotype. 

90. A screen according to claim 89 where the change in phenotype involves a 
change in T-cell phenotype. 

20 91. A screen according to claim 89 where the change in phenotype involves a 
change in B-cell phenotype. 

92. A screen for detecting a side effect associated with the use of an agent which 
modulates DPP10 comprising: 

25 

providing a cell which does not substantially express DPP10; 
providing an agent to be tested; 
contacting the agent to be tested with with the cell; and 
measuring any side effect produced by the agent on the cell. 



T 
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93. A screen according to claim 92 where the side effect involves a change in cell 
differentiation. 

94. A screen according to claim 92 where the side effect involves a change in cell 
5 proliferation. 



95. A screen according to any one of claims 92 to 94 where the cell is part of a 
transgenic animal. 

96. A screen according to any one of claims 92 to 95 where the side effect is a 
measure of the change of phenotype. 

97. A screen for identifying an agent which modulates DPP10 activity comprising: 

providing a DPP10 nucleic acid according to any of claims 1 to 4; 
providing an agent to be tested; 

measuring whether the agent to be tested modulates DPP10 by measuring the ' 
interaction of the agent with the sample of nucleic acid. 

98. A screen according to claim 97 where the screen is an in vitro transcription 
assay measuring transcription of DPP10. 



25 



99. Use of a nucleic acid sequence according to any of claims 1 to 4 or 38 to 42 or 
a polypeptide sequence according to any of claims 8 to 13 or 46 to 51 in a screen for 
an agent which modulates the activity of DPP10. 
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Figure 1. 

1 GGCAAAACAA ACGAAATCAC TCATCTTCCC CCAGCATGCC TTGCTATCTC 
51 ATGACTCACT GTGTTGTCAT CTGTGCCTCA GGAGGCCGAG GTGCTCCCAT 
101 CCTCCCCACA GCCTTAGCTC CTGCTCCCTC CTGGGGGACC CTTGCATACA 
151 TTAAGGGCTT GTGAAGGCTG ATTGTCTTTG GGGGTTATGT CTTCTCTCTT 
201 CTATATACCC GTGGCACCTT GTATTTTATG ACATTTGTCA AGTTTTAAAA 
251 AATTGTTTAT CTGTTTTCTT CCTAATTGTG AGCCCTTGAG GACAGGGGCC 
301 TTGTCTTCTC TTTCTCATGC TATGCTAGCT CAGGGTTCAC CTCACCATTT 
351 GTGCTTCTTC AATACCCGTA GGCTAAGTTA ATGGCTGCGT TCAATTTTTT 
401 TTCCCAACCA GTTACATCTC TGCTCGGACC CAAGATAGTT TACTCAGTGA 
451 AGAGGATATG CTTCTGCATT TTTGCAGCTC TTCTCTTAAG TTACCTGAAA 
501 ATACCACCAG GAGAAGCTTA CGACAACCTT GCCATTGCTG TTGTCACCTC 
551 CAGGTAGGGT GTCCCTTCAT TGCTGCTGTC ATTTTGCCAG ACTTAACCTT 
601 TATAAGAAAA CACATGCTAA CCTCTATCAA TCTTCAGCTT ATTTCTTTAA 
651 TCTACCTGCA CTGGTCACAT TTGCTCATGA AAGTACAGAA AGAATAATTA 
701 GTTTTTTATG GGTCTTAGAG TTATGGAAAT GTTGAGCTTC ATGGTACCTT 
751 AGAAATTAAT GAGAATCAAC ATTTCATCCA TGGATCAGGA AACTGTGTGA 
801 CCCAAGAGAA GGGAAGCGGC TTCTCAGGGT CACACAGTGG GTCAGTGATA 
851 GAGCTAGAAT TAGTGCCTAG CGCTTCAGAT GCATTATTAG TGCTTTTTCT 
901 AATTTTTGTT AGTGCATTTC ACTCACACTC ACACACACAC ACACGGTTTT 
951 . CCAGCTATAG AATTCCTTCT AATGAAATAG AAAATTGCAC AAAAGTTTAA 
1001 GTACTTAGTG CTAAGATATT TGAATTTATA AAGCTCTTTG AATTTTTGGA 
1051 GGTCTACGGG TGCTCCAACA GGAATTTTGT TAGAAGGTAT AACATTAGCA 
1101 ATGTTAATGA GGGAAATTCT TCATTATGTG GGACTGTCTG GAGCCTTGCA 
1151 GGCCATTTGG CAGTCTTGCC ACCCCCAAAT CCCAAGATAT TGTGACACCC 
1201 CCACCCCTAT GCTTGCTTAC ATTTCCAAAC ACTCTTGATA GTGGCATTAT 
1251 TGCCCAATTG AGAATCACTT TGTGATTCTC AGAGCAAAAG AGGTATAGAG 
1301 GGGTTGATTT ATTTGATTTG CTTGATAAGT AAGTTTCTGC CTGGAACAAT 
1351 GAGTGAACTC AGCTAAATCC ACAGACAGGG CTGTTAGTAA CACTGGGACT 
1401 GGGCTGATTC TTGCAACCAC TGGTTGTATG CTCTTTTCAT CAGATAATTG 
1451 ACTTATTTCT AACTTAGTCC ATACTATGCC CCACTCAATG ATGTCCATGC 
1501 TTCACCAGGG ATTGCTAATA GGATGGAGAG ATGGGAGGGA AAGAGCAAGG 
1551 AGCTGGGGCT CAGAGAAGGA GAACACATGG AAGAGGACTC ATGAGGCAGC 
1601 AAAACTCCAC AGTCATTCAA AACACACCTG CAGGAATCTT TCCTGCTGCT 
1651 AAAATTCCCC CAAGGTCTGA AAAGGATGGA ACACACCAAT TTGAGTTACA 
1701 AAAATCTTTA AAAATGCACG TCGGGTACAC CTCATCACTG CCAAGCAATA 
1751 GCGAATGCAG TGATGTGGCT TCTGCAGTAC TTTGCTCTGT TAAAGTTTAT 
1801 CTTCTTAAAA ATTTGTCTTC CGAGTTGGCT TATAAATGCC AATTATTTCC 
1851 ATGGGGATGG AAACCAAGGC ATCCCTTTTA GGTGGGAATG TGGAGAGGTC 
1901 ATGATTATTT AAACAGAGAA TTTTGTGGAG AGCTGGCATA AGGCTTGAGA 
1951 AGTGTTGTTA TTGCTCAAAG AACCTGCAAA CTGCAGGGAG AATCTGGATC 
2001 TGTGCAGGAT AACAGATAAT AGAGCTTTTT GGTCCTGACT GATTTACTTG 
' 2051 AAAGGAAAGG GGAGGGACAG GATGCTGGCT GCAGGGCTGG ATGGCTTGGG 
2101 GAGGGCACAG GGTGATGGGA GACAGGAATG GGGGGTTGCT CTTGCTCTGT 
2151 CACCAGGCTC CATACATAAA AGCTTCCTGT ATCATGAGCT TATTTCCAAC 
2201 ATCTCAATTG AGTCAAGACC TACGGCATGA GTAAGGCCAC AGAAGTGGAG 
2251 TTATGTTCCA CTTTCTTGAG AGTGAAGTAT ACATGTAAAT TATTAGAAAT 
2301 CGGTCTGCAC GGTAGATTTG TCTATTCTTA CCATTAATCA TATAAATTTA 
2351 TATTTGACTT AATGAATTAA TATTTATATT AAATTTACTA GTTAATATTT 
2401 ATAAGATCTT AGTATCTATA AGATCTTAAA TGAATTATTT TCCCAATTAA 
2451 TTAAATCATA CTGTACAGAC TCATTATATT TACTTTATAC TTCAGGTAAT 
2501 AATTCAATAC TACTTATTTT GTTGTTCTTT CTGTTGGCTT ATGTGTCCTT 
2551 TGTATCCCTC CATTACTGTG CTTTTTTAAA AAAAAATTTT CAGAACTCCT 
2 601 TTATATTCTG GAACTATGAA TTGCTGCAGG TTCATCTTAC ATATTTCCTG 
2651 ACCCAGTCCT AGAGTCATTT CTCCTAGGAG CCCATGTCCT TTTATTGGAG 
2701 GATGATATTA GAAGCCAAGA TTTGGATGTT ACGTGTGTTC TGCCACTGGG 
2751 ATTTTGTTGC TTCTAGGCTA CCTTGGCTGA CAGAAAAAAG AAATATATGT 
2801 GTGTATACTA AACTGTTTAT ATACACATAT CTATAAATAT CTCTAATGTA 
2851 ACCATCTGTA TTTATATTAA ATATAATATG AATTTGTGCT AATGTTTCTC 
2901 ACTCTAATCT ATTACACATA CACCATTCTA GCCTTCTCTC CTTGCATACT 



WO 02/086113^p M PCT/GB02/01887 

2/219 

2951 GCAACCTTCC ACTTCAACAG TGAAAAATCT GGCCCTCACC ATCTTCCATT 
3001 TCATTTACCT AGTCATTCAA TTCCAGCATA CATGAATAGT AGTTTCAGAA 
3051 GTGTTAACCT GTACCTCCAG GGGAAATCAA TTTATCATCG AGAGGACAGT 
3101 GTTTATGTGC AGCTCCTTTT GCCTTAGTCT TACATGCTCC ACTCATTTCC 
3151 AAAGTTACAT AGGTCAGCAC CTTTTCCTCA GTCCCTTTCA GTGAGATTGT 
3201 TTCATACACT TGTTACACAG TTTGAGTTTT AGGTCGCATC CTGCATTCCA 
3251 TCATTGCATT CCTCAACATC CTAAATTATG ATATTTTCTC TTGCATACAT 
3301 TAAAGTTCCC TCATGCCTTA AAATTCTATT GGTTTTGATA AATGCATAAT 
3351 GTCATATGTT CACAATTACA GTATTACACA GAATAGTTTC ACCCCCCTAA 
3401 AAAAACTTCC TGTGCTTCAC TTATTCAACC CTATCTCCCC AAATCCTTGG 
3451 CAACAACTGA TCTGTTTACC ATCTTCATAG TTTTGACTTT TGTAGAATGT 
3501 CGTATAAAAG GAACTATAGA GTATACAGCC TTTCCAACTG GCATCTCTTA 
3551 CTTAGCAGTA TGCATTTAAG ATTCATCCAA GCATTCGTGT GGCTTGATAG 
3601 CTTATTCTTT TTTTAATGCT GAATGGTATT CCTTTGTATA TATGTACCAT 
3651 AGTTTGTTTA TGTATTTACC TATTGAAAGG AATCTTGGTT GCTTCCATTT 
3.701 TGGGGAATTA CAGATAAAGC TGTTATAAAC ATTCATGTGC TCTGTGTGTG 
3751 TGTGTGTGTG TCCCTAAGTT TTTAAATCAG TTGAGTAAAT ACCAAGAAGT 
3801 GCAATTGCTG GATTGTATGG CAAGTCTATG TTTAGCTTTG TAAGAAACTG 
3851 AAAAAACTGT CTTTCAAATT GGCTGTACCA TTTTGCATTC CCACTACCAG 
3901 TGAATGAGAG CTCCTATGTA TCTGCAGCCT TGCCAGAAAT TAGTATTGTC 
3951 GGATAGTTCC ATTTTAGACA TTCTAGTAAG TACGTGGTAG CATCTCACTC 
4001 TTGTTTATAT TAACATTTCC CTAATGACAA CCGATGTTGA GCATATTTTT 
4051 ACATGAAATC TCACACTTTT TTAAACTATA CATTCTTACT AAGAGTTGAA 
4101 GAAAGATCTC TGGCCAGCTG GCTCAGCTAC ATCCAGCATG GGTGATATCT 
4151 GGACATTCAA ATAAGTCTCT GAAGTCTGGA ACAGTTGATA AGGATGTATT 
4201 TCAGAAGCTG GAAACATGAA GGCATCAGGG CAGGTGGTTG GGGTAGTTGC 
4251 ACCATGCTGA ATTAAGAGTC AAGAATCAGG TCTAACAGGT AGGCACCCAT 
4301 TTAGGGCAAT GTCCTGACTG TTTTTCTCAC ATCACATACA TTTTCCATAT 
4351 CCTCCTTCTC CATTTCTACA TATATATATA TATGTATGTA TGTGTGTGCG 
4401 TGTATATATA TGTACATATA TATATATATA TGTATATATA TGTACATATA 
4451 TATATATATA TATATATATA TATATATATA TATATATATC TGTGAAGCCA 
4501 TCATCCTGAT CAAATAATGA ATTCATCACT TCCAAAAATT TCCTCATGCC 
4551 CTTTTATTGC ATTTCCCTCC TACACTTTCC CTGCTTCTTG GCCTAGGCAA 
4601 ACATTTATTT GCTTTCTGTC ACTGTAGATT AGTTTGTATT TTCTGGAGTC 
4651 TTATACAAAT GGAAGTAAGC AGTATGTATT TTTTTCTTAC TTCCTTTACT 
4701 CAGCATCATC AATTTGAAGA TAAGCCATAT TTTTGCATGT ATCAATGGTT 
4751 CATTATTTTT TGTTGCTGAA TAGTATTCCA TTGTATGGGT TATACTATAA 
4801 TTTGTTTATC TCTTCCCTTA CTGATGGACA TTTGGTCTAT TCTAAATAAA 
4851 GCTTCTATGA AAATCATGTA AATTAATATA TGCTCTCATT TCTCTAGCTA 
4901 TGTGCAAATA CTTAGAAGTA GAAGGCTGAA TCATATGATG GATATGTATA 
4951 TAACCTTTAA GAAACTGCCA AACTGTTTTC CAAAGTGATT* GTACCATTTT 
5001 ACATCCCACC AGAAGTATAT AAGACTTTCT TTTTTTTTAC ATTCTTGCCA 
5051 AAGCTTAAAA TAATTGTTTG CTTTAGATTT TAGTCATTAT GACAGTTGTA 
5101 TAGTGGTATC TCATTGTGAC TATAATGTTG AGCGTCTTGT CATATTTATT 
5151 TGCCATCCAT ATATCTTTTT CAGTAAAATG TTTGCATAAA CTTTTTGTCC 
5201 ATTAAAAAAA AATGGATTGT TTCATTCCTT ATAAATGAGT TTTGAAAGTT 
5251 TTTTTTTTAT GTTGTGGATT CAAGCTGTTT ATCACAGATA TGATTTTTTC 
5301 TTTTTTTTTT TTAAGATGGA GTTTTGATCT TGTTGACCAG GCTGGAGTGC 
5351 AATGGCATGA TCTCAGCTCA CTGCACCCTC CCACCTCCCA GGTTCAAGAG 
5401 ATTCTCCTGC CTCAGCCTCC TGAGTAGCTG GGATTACAGG CATGCACCAC 
5451 CATGCCTGGC TAATTTTTGT ATTTTTAGTA GAGACGAGGT TTCTCCATGT 
5501 TGGTCTGGCT GGTCTCGAAC TCCTGACCTC AGGTTATCCG CCCGCTTCGG 
5551 CCTCCGAAAG TGCTGAGATT ACAGGTGTGA ACCAACGCAC CCAGCCTATA 
5601 TGATTTTTAA. GTATTTTCTC TCAGTCTATG GTTTGTTTTT TTGATTATGT 
5651 TAACAGAATC CTTTGAAAAG AGAAGTTTGA AATTTTGGTG AAGTCAAATT 
5701 TATAAATTTT TTTTATGGAT TTTTTTTATG CTTTTGTTGA TTTATCTAAG 
5751 AAAACTTTGC CTAACCCAAA GCCACAAATA ATTTCTTTTA TATTTTATTC 
5801 TAGAAGTACT GTGATTTTAA GTATTACAGT TAGGTCCACA ATTTATTTCA 
5851 AATTAATTTT TATATATGGT GCAAGGTATC GATCATAATT TTTTTTTGTA 
5901 CATGGCCATC TAATTGTTCT ACCACCGTTT GCTGAGTAGG CTATTCTTTT 
5951 GCTATTAACT TGTCTTTTCG TGTTTGTCAA AATCAGTTGT" CCATATACAT 
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6001 GTAGATATAA TTCTGGGCTA TTCTATTCCA CTGAGCCATG GAAAAAGCCA 
6051 TAGACAAATC CTTATGCCAA TCTCATAATT TTTATTTTTG AAGCTTTGTA 
6101 ATAAATTTTG AAATCAAATA TCATTTTCTC TTCAACTTTA TTCCTTTCTG 
6151 TATCTCTCTT TTAGTGTGTA CCCACTATAT TGCCATCACT TATTGACACA 
6201 ACTCTTTAAC CTAGTAGGCC AAGAGCCCTT TGTAGGTGGG AAGCATGACT 
6251 TACTCATTTT TTAAAACATA AATCAGACCA TGCCAGCCCC CTACTCAACC 
6301 TGCCAATGGC ATCCTACTGC CAGCCCCCCA TTCAACCTGC CAACGGCATC 
6351 CTACTGCTCT TACACAGGAG CCGTGCCGCA GAGGGACTTA TTGGCCAAGG 
6401 TAAGGACTTT GGATACATTC TATCTCTCTA GTTTCACTCA GTTTCTTTGT 
6451 GGCTGCTGCA TACACACAAG CACATTCCTG TCTCAGTGAC TGTGCGCGCC 
6501 TTTCTCCCTC AGCCTGGAAA GCTCTTTTCA CTGAGGGGCA TGTGACTTGC 
6551 CCCTTCTTTT CACTGCTCTT ACACACCCAC ACAGCCAAGG CTTTCCTGAC 
6601 TTCTCTTTCT GGAGTGTCTT TTCTGTGATA TCTATTCCTC TACTCTGATT 
6651 TTTTGTTTTT CAAAAAAAAT TTTTATTGGA TATTTATTAG TTTATTTACA 
6701 GTTTATTTCC TTCAGTAGAT GATGAAATCT TTATAAGTGA AAGTCTTTTG 
6751 TCTGCTTTTT TTTTTTTTTT TTTCTGTATA CCCAGCATTT GAAACTGTGT 
6801 CTGGCACATA GTGGATGCTT AAAAATAAAT CTTTGGAACT CATTCCTTAG 
6851 TACAATGCTT GCTGGATAAG AAAATGTAAT GAAAAGAATT GATCGAGGAA 
6901 GACTGACTGC TTAGAAAGGC TTGTTTGCAG GGCTGGTGCT TGACTAGCAT 
6951 CTAGGAACTT GGATTTGGGG AGAGTCTCCA CCATTTCCAG AACTGATAAA 
7001 AGTGGCTTTT ATGCTTAAAC TAAACTGTTT GTGCAAACAA TATGGCTTAT 
7051 GCTGAATACT TGCTTTTCTT TTGGGAGTCT TGGAATTTTG GTATATGCTA 
7101 GTGGAGGGTT GCCCATGTGA CCTGCCCACA ACAAAAGCCC CTGAGCAGTA 
7151 AATCTCTGAA TGACTTCTCT GGTAGACATT TCACATGTGT TATCATAAGT 
7201 TGGTGCTGGA AGAATTAAGC ACGTCCTGTG TGATTCTGAT GCAGGACAGG 
7251 AGAGTCCCAA AGTTGGGGCT TAGCCTGGGA GGGTTCCTGG CTTTGCCCAG 
7301 GAAAGAATTC AAGGGCAAGC TGGTGGTGTT AGCAACTTCT ATTGAAGTGG 
7351 CAGTGTACAG CAGCAACAGA GGTGCCACTC CTTGCAGAGC AGGGCTACCC 
7401 CAGAGGCAGT GTGCCAAGAG TAGCAGCTCA GAGGCAGTTC TGCAGTCACA 
7451 TTTATACACA CTTTTACTTA TATGCAAATT AAGGAGTGGG TTATGCAGGA 
7501 ATTTCTAGAA AAAGAGTGGT AACCTCTGGG TCATCAGGTC ATTGCCATGG 
7551 TAAGAACTGG TAACTTCTGT GAGTTGCCAT GGCAATGGCA AACTGATATG 
7601 GCACACTGGT GGGCATGTCT TATGGAGAAG TATTTTCACC TTTTCCCTGT 
7651 TTTAGCTAGT TCTCAATCTG GTCCAGTGTC CAAGCCCCAC TTTTGGAATC 
7701 TGGTCCTGCC TCCTACTTCA ATTCCACAGA CAGAAGACTC TTGGAAGTTT 
7751 ATGCCTGGTT TCCTCCAGAC TTCATCTTAT GCATTTTTCC TTTTGCTGAT 
7801 TTTGCTCTGT GTCCTTTCAG TCTGTTTATT AGTCAGGCTA TGATCTCCTC 
7851 CATTTAATGG CTGTATTATT TTAGGGAAGT TGTTTAACCT CTCCATACCT 
7901 ATTTCTTCAT ATAGAATGGG AATAATAGCA GTGTCCACTT TATAGTGTTG 
7951 CTAAGAGGAT GGAGAGTTAA TTCCAAAGAC AGCTTTACAT GTGACCTATT 
8001 ATTTCAGGAG CACTAGGTGC TAGCACTGTG CTACATACTT AGCAGGTAAG 
8051 CTGCAAGGTC GGCTTTGCCA AGCTGCTGAG AACTACACTG TGAGTTCTAG 
8101 GTCTCCATGC ATCCTTCACA TGGGGGCTAC ATTTGCACCA GAGCCTGAAG 
8151 TGGCTGAGTT TCATGTGGGG ACTAAAGGCC CTACCTGGCT AGAACAGAAG 
8201 GAGCAGGTGT CAAGGTTACA ACTTGACTCT GTGGGGAACA TTCGAGATCA 
8251 ATGGAAGTCT GGGTCTTTTA GGTTTCTAAA TTATTCTGGG AATCAATGTA 
8301 GGGATTCATA CCTCCTCCAG CAGTTGCCAA TATGAATTTC TCCTTTCCTC 
8351 CTTTCTCTAA TGCTGAGCTC CGTGAATGAC CACTTATATT TAAATTCAAA 
8401 TGTTTATATA TGAAATCTTT ATACAGGCTG ATTTAATTTT TGTGTTGCTG 
8451 GAAGAGTTAG GGGTGCAGAA TCCTATATGG TGATGTAACT GAATTCTGCA 
8501 AGTGGACAGT GTATTAACAA GATGGGTTGG GGGCCAGGGA ATCTTGGTCT 
8551 CTCCAAAGTT GTTCTGTATG GTAGTCAGCA ATGGGTAGTT CTTTACTGCC 
8601 TGTTTATCTG CCCTGATGAC TGAAGAATCT GCATACTGAT ATGGCAGACC 
8651 CATTGGATTA AAGACTCCTG GACTGCAAAG TGCCTGCATA GAGGCCAGAC 
8701 ACTCTGGAAA GCCTCCTAGT CCCAAAACAG ACTTGTTTGA GTAAAAAGTG 
8751 CAATAAAGTC TTGATGTGTA ATTATGGAAT TGTTTGTTAT TCAGCATAGA 
8801 TTATCCTAGC ATGACTAACA CACTTGTTTT GCTCTATCTG TCCTGCTCCC 
8851 CGTCCTCTCA CCCCCATTTC ACCCTCCTAT ACTAGGCCAA CTATCCTACT 
8901 GTCTTTCAAT TGTCAAATTT TAGCAATTTT AACCAAGCTT TTTGTTAACC 
8951 CCTTTGTCAT AACGACTTAA GAAAGAAATT TTGGGCCAGG CGCGGTGCCT 
9001 CACGCCTGTA ATCCCAGCAC TTTGGGAGGC CGAGGCGGGC GGATCACGAG 
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9051 GTCAGGAGAT CGAGACCATC CTGGCTAACA CGATGAAACC CCGTCTCCAC 
9101 TAAAAATATA AAAAAAATTA GCCGGGCGCG GTGGGGGGGC GCCTGTACTC 
9151 CCAGCTACTG GGGAGGCTGA GGCAGAAGAA TGGCTGATCC CGGGAGGTGA 
9201 AGCTTGCAGT GAGCTGAGAT CGCACCACTG CACTCCAGCC TGGGTGACAG 
9251 AGCGAGACTC TGTTTAAAAA AAAAAAAGAA AGAAACATTG TTGTGAGAGA 
9301 ATGTGAGGAG GGCACCTTTT ATACCTTTTA TTTAATTAGC ATTAAAATTA 
9351 TTTTAAATCA CCAAGTGTAT TCTCTGGTCT GTTGAATGAC TATTTAATTC 
9401 ACCCTCAAAG CAGTTGTACT CCTTTGAGTC AGGCAAAGTC CCCAAGAACG 
9451 GCCTTCTGCC ACAGGAGACA GCTCCCTCTT TCTGCCATTT GTTCACATTA 
9501 CTGATTTATC TATCATCTGT TCACATTATC TGCCCATCAT CCATCCGAAG 
9551 CGGAGAAAAG TGCCATCACC AAATTTGGTT TCTCTATTGT AGTGTTTTTT 
9601 TGCTTAGGAA CATTATAATG TTCCTAAGAA ACATCATAAT GAAATGAAAT 
9651 GAAAGAAAGC TCTCATTTTT CAGGGAACTT TCTTTCAAAT GCTAACAACG 
9701 ATAGATTAGC TTTAATAGCC TTTTTCATTA ATTGTCTAAC CAGAGTAAAA 
9751 ATAAGTCTTG GGAGACTGAA TTTACTTTCA TATGCCTCTA TAATGTCAAG 
9801 GCACTATGGA GTATCTTTAG GACTGAAGCC AATCCCAGAA TCCAAATTGC 
9851 TTTTTCTGAG CTATTTGACT CTGTTCTTAT AACTATAATC TCAGTATGTC 
9901 AATTTTGACA TTTTTGTCGG TAGACGCTGC ACTTTAAAAA TATAAGGTAG 
9951 GGAATTATTA ATGGTATTAA GGGGAGATAC AGAAAAATCA CCCAATGTCA 
10001 TTTACATCTA TCCCATCAAA GGATGGATAA CAATCTGTCA TGGAAGGTTT 
10051 AAGTGCCCTC CTGCTAGGGG CGAAAGCAGA GACAGCAGGA ATGGTATTAA 
10101 ACTACATGAC CCCTTAAGGC CCTGTTCACT TTTAGGAGCT ATAAATGATG 
10151 CATTTCAGAG CTGGAAGTAT CATTTCAGAT AATCTCTTTA TATATGAGAA 
10201 GACCAGAGCC CATTTGGGGG TGATAACAAC TGTTATGGTG AGATGAGTTC 
10251 AGAATCCCTG CTTCATTCAA GATTTAGTAA CTTTGATCAC CTACCATGTG 
10301 CCACGCTGGG GCTACTGTTG TCAAAAACAT AGAGCACCTG ACCACATGGG 
10351 GCTCACGCCT AGTTTTTCAT TCATGGCTGT CTTTGTGCTG CAACAAATTG 
10401 GTTCAAATTT GGAAGTTGTT TTTCTTCTAG ACCATTCTTC ACTCTGAGAT 
10451 GCAAGAGTGC CCTAATGTCT TTTGGACCAT GATATTCTTC CCTTAAAAAA. 
10501 ATTTACTACT TTATTTTAAG TATTGCCCCT CCATTGGATC TTGTTCATCA 
10551 GTGCAGCAGC TCTGGGGTAG AGAATACATA AAATTCTTAA TCCCTTGTTG 
10601 ACAATTGATT TATTATGTCA TTTTTATAGA TGAGAAAAAT GAGGCTCTGA 
10651 GAGCCTAAAT TACATGCCCG TGACACAGAA ATCCTAAGTA TGTGAGATGA 
10701 GATTCTAATG CACGTCTGTA TTAAAGAATC CTGGACGGCA AAGTGGCTGC 
10751 ATGGAGGTCA GACACTCTGG AAGCCTCCGA GTCCCCAAAC AGACTTGTTT 
10801 GAATAAAAAG TACAATAAAG TCTTGATGAG TAATTATGGA ATTGTTTGTT 
10851 ACCCAGCATA GCTTATCCTA GCCTGACTAA TACAGCTGCT TCCAAAGCTT 
10901 TTAACCATCT TCTCTCCAAG CTACTTTAAT ATCCCAACTC TGTGTATTCT 
10951 TGGAGAATAT GTTCTAGAAA ATCATAATAA ATAGAGATTC TGTTTCTTCT 
11001 CAGATATTCC TTCTTTTCCA GCACACTTTC ATTGGTTCTG ATTAACTTTG 
11051 GGAAATAGCT TCAGTGAAAC ATTTAGGGCA TTAAGTCCTC TCAGAGTAGA 
11101 AAAGGATGAT TTCAAGGTGA AGGGGCAGGG AGTATTGGAA GCCTAGGGTT 
11151 CAAAATCTGT GACTGAGAGG GATGACCAAG AAGGCCAGAC TATTTTTTTT 
11201 AAGAGCTATA CTCAGTGAGT GAATTCTGAA TGGTACAAAT GAGGCTTGTT 
11251 GCTATGACAC CAAATGATTC CATATTTAGT GCAATGGAAT CCCCATTTGT 
11301 GTTCTTAAAA AATTAAAGCA GTTTTATGGA TATTAAATGT TTAAGTGGCT 
11351 CTAGCAGCTA GAAAGAAAAA TTGCCTTATG AATAAAACAG ACCAATGCGT 
11401 GATTGCTTGC ATGGAAAGCA TTTGAAAAAT GTGTTTTCAG AGTATGGAGC 
11451 TTGGCAGCTT TTGTTTTACA GTGTGCAATG ATAGGAGGAA ATGAATGAAT 
11501 GAATATATAA ATGATTGTAG GCAAACTATG GAATGAAAAA ATTGTGGGAG 
11551 AGAAATTAAG GTTACTTCAA TACACTTTTT TCCTGAGCAT TTATTATGAG 
11601 TTCAGCAACA TGCAGGGTGT TGGAGCATGC TAAAGCAGCC CCTTGGGTGG 
11651 AGTTTGACTT TACTCTTCCT CCATTCATGT ATTAGCTAAA ATGACACTAG 
11701 CATATTTGAC CATGTACTAA ATAAATTCCA AGATTCCAAT GTCTTAATAC 
11751 AGAAGATGTT TTAATAATCA AGTTTACTGA GGAATAATTT ATATAAAGTG 
11801 TACAGTTCAA TGAACATTAA CAAAGATTTA CAACTAAGTA TACAGCACCA 
11851 CAGCCAAGAC ACAGAATATT TCCATCACCC CAAAAAGTTC CCTTATGCTC 
11901 TTTTGGAGTA GTTCCCTCCT TCCACCCCCA GAACTGAGCA ACTACTGCTC 
11951 TGATTTCTGT TCCTATGATT TTGACTTCTC CAGAATGTTA CATTAATGGA 
12001 ATCATTCACT ATATAGCTTT TGTGTGTGTG CCTGTCTTTT TTACATAGCA 
12051 TAATGCTTTT TGAGATTCAT TCATGTTCAT ATATATATAT ATATATATAT 
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12101 CTTTAACACA TTTTTTATTG CTGAATGGTA TTCCATTGTA TGCATTTGCT 
12151 GCAATATGTT TATTCATTTA CCAATTAACA ACCATTTGGG TTGTTTCTAG 
12201 TTTTTACTTA TTGTGAATAA AACTTCTTAA TCTTGTACAG ATCTTAGTGT 
12251 GAATCACAGA GTAAATTTTT CTCACTTATG CAAATGTTCA ATATGGGTTT 
12301 TCTCAATGGA CAAAAACAGC TTTTTGTTTT TTTCCCACAC AGTAACTTGG 
12351 AAATCTAGGC TTCTTTTGTC TTGTGGCTCC CTAACCATCC AGGGCCTTGG 
12401 ACTTGTTTGT TTCTAGGCAT TGGAAAGGAA AAGGATCTAC TTCAAGTACT 
12451 CCATCTCAGA AAGGAACCAA ATCACTTCTA CTTATATTCT GCTGGTAAGA 
12501 ACTAGATATG TGGCCACCCC AGGATGCAAG GGGGCTAGGA AATTTATTCC 
12551 ATAAACAACC CTATGCTATG GACAGGGGTG CATTGATTTT GGGTGGACCA 
12601 ATATTTTCTG ATAACACAAT TTATTTGCTC ACCAAATATT AATTGAGAAT 
12651 CTACCAAATA TCAAGTACTG GGAGATAAAG CAAGGACCAA ACAAATAAAC 
12701 CAAAACAAAT CCAGGCCTTA ACATCTTGTG TTTATAGTCT GGATAGATTT 
12751 TAGGGAAQTA TAATGTGTAT CTTCATCCAG AAAACAGTCA CTAGGAATAA 
12801 AAGTGCCATA TCAGAAATAA TGGAAAATGG AAAAGATATG AAAATTATAA 
12851 AATAATAGAT GAAGGGAAAA TCACTGGAAA TGGGGGAGGG AAATCTCCCT 
12901 GGAGGGGATA AATTTTCATT TGGGTCTTAA AGATGCAAGA AATGTGGATA 
12951 GGTAGATGGC GGTGGAGTGG GAATTTCAAA AGAAATAAAT AATCCAAAAA 
13001 AGGGAGTGGA AGAAGCCATA TGCCTTGGGT ATGCCTTGCT TGGGGGTGGC 
13051 AAAAGAAATT TATCTGACTT CCCTGCTTTG TTAAGCTGCC ACATGCTGAC 
13101 CTTCCAGCCA TAGTGCATTT AAGTTGCTTT CTAAGGAAGT TCTATTCAAT 
13151 TTGACTGCTT AATGGCTGCT GCCGTTTTAT GGATGGTGAT TGAATGTTTC 
13201 AAAACGTGGA ATTGCATACC CCATGGTGAA ACTTCTTCAT AAGATGAATA 
13251 AGTAGTGCCA TCACGTTGAA GGAGTTTTCT AAATATTAAT AGAATTAAAT 
13301 AAATTAAGAC AACAAGAAGA AAGCTACTCC TTTTAAAGAA CGGTTTGGAG 
13351 CTAGATCATC CCAATTACAA ATTAAATCCT TGACTGTTTA AATCTACAGC 
13401 TCTATTAATA CTCTAATATG CCATATTATT AGTTCACAGA TGCTGGCCCC 
13451 ATGGCACAAG CAGGTAACAA AAAAAGAGGA CTGTGAGTTC TGCCACAACT 
13501 GGCTTGTTTG CTGCCACAGC CTCTGAAGAA TGCCTTGACC TCCACTTACC 
13551 TCTCAAATCT CAGATGAACT GATTTCACTT ACACAGCCTC AATTTCATTC 
13601 ATATTCCTAG ATCCAAGAGA GTCTAGGAAA TGTAGATTTT AGTCTTTTAG 
13651 TGTTTCTGAC TTAGGACATG GAAGGGCTAA GTGGCTATCA TTGCTAAACC 
13701 CCAATTGGCA ACAACTAGTC CAATTAAATT TCTTTTTTGG CTCCCTCACT 
13751 TTTTTTTTTT GTCTTTCTGG CAGTTGAAGA GAAAGAGTTT TTTTGTTTGT 
13801 TTGTTTGTTT GAGACAGAGT CTTGCTCTGT CGTCCAGGCT GGAGTGCAGT 
13851 GGTGTGATCT CGGCTCACTG CAACTTCCAC CTCCCAGGTT CAAGCAATTC 
13901 TCTGCCTCAG CCTCCCAAGT AGCTGGGATT ACAGGTGTCC GCCACCATGC 
13951 CTGGCTAATT TTTGTATTTT TAGTAGAGAC GGGGTTTCAC CATCTTGGCC 
14001 AGGCTTGTCT TGAACTCCTG ACCTCATGAT CCACCCATCT CGGCCTCCCA 
14051 AAGTGCTGGG ATTATAGACA CGAGCCACCG TGCCCAGCCG AGAAAGAGTT 
14101 TCATGTAAAC ATGCCATAGA AAGAGAGACA TTGAGACATT ATTATTCTTT 
14151 TAAATAGGAG AGTTCTTGTA TTTTTATTTG ATTACATTCT TATAAAAACC 
14201 AGTATTAAAA GAAAACCAAT TTTAAAAGAA CACCAATTAA AACCAATTAC 
14251 AATTCGGATT GTCCATGGAA ATTGATATCA CATAGTACTA GGACTTCTGC 
14301 ATCATGGACA CACTAGACTT TTTATGCAGT GGCTGTGTTC CTCATACATC 
14351 TTTGCTGCAT CTCCGCCCAA ACAATTTTAT TTTACTTTTT AAAAAAGATG 
14401 ACTAAACATT TTATTTATTA AGTTTATTTT TTTAAATCTT TAAAATTGTT 
14451 TTATTTTCAT AGATTTCGGT GGTACAAGGG CAGGTTTCTT ACATGCCTAT 
14501 TGAGAGGTGA ACCCAGCTGG GCTTCTGGGT TGGGTGGGGA CTTGGAGAAC 
14551 TTTTCTGTCT AGCTAAAGGA TTGTAAATGC ACCAATCAGC ACTCTGTGTC 
14601 TAGCTAAAGG TTTGTAAATG CACCAGTCAG CTCTCTGTGT TTAGCTAATT 
14651 GGGTAGGGGA CTTGGAGAAC TTTTCTCTCT AGTTAAAGGA TTGTAAATGC 
14701 ACCAATCAGC AGTCTGTATC TAGCTAAAGG TTTGTAAATG CACCAGTCAG 
14751 CTCTCTGTGT CTAGCTAATT GGGTAGGGGA CTTGGAGAAC TTTTCTCTCT 
14801 AGTTAAAGGA TTGTAAATGC ACCAATCAGC AGTCTGTATC TAGCTAAAGG 
14851 TTTGTAAATG CACCAGTCAG CTCTCTGTGT CTAGCTAATT GGGTAGGGGA 
14901 TTTGGAGAAC TTTTCTCTCT AGCTAAAGGA TTGTAAATGC ACCAATCAGC 
14951 ACTCTGTGTC TAGCTAAAGG TTCGTAAATG CACCAATCAG CACTCTGTCA 
15001 AAACAGACCA ATCAGCTCTC TGTAAAATGG ACCAATCAGC AAGGTGTGGG 
15051 TGGGGCCCAA TAAGGGAATA AAAGCAGGCA ACCCGAACAA CCAGTGGCAA 
15101 CTCGCTAGGG TCATCTCCCA CGCTGTGGAA GCTTTGTTCT TTTGTTCTTT 
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15151 GCAACAAATC TTGCAGCTGC CCACTCTTTG GGTCCACACT ACCTTTAAGA 
15201 GCTGTAACAA TGAGCTGTAA CATTCACTGT GAAGGTCTGC AGCTTCACTC 
15251 CTGAAGCCCC TGAGACCAAG GAATGAACAA CTCTGGACAC ACCGCCTTTA 
15301 TGAGCTGTAG CAATGAGCTG CAACACTCAC CACAAGGGTC TGTGCCTTCA 
15351 TTCTTAAAGT CAGCAAGACC AAGAACCCAT GGAAAGGAAC CAATTCCGGA 
15401 CACAGTATGT TGCATGGTGA TGGAATCTGG GCTTTTAGGG AACCCATCAC 
15451 CCGAATAGTG ACATTGTGCT CAAGAGAGAA TTTTTCAAGC CCTCACTCCC 
15501 CTCCCACCCT TCCAACTTCT GTAGTCTCCA ATGTCTACCA CTTTATTCTA 
15551 TATGTCTACG TGTACTCATT GCCCCAAACA ATTTTAAGTC AACAGTTCAT 
15601 GAAGCGTAGG TATTTTATGA AAGATTCAGT TATACGATGG GGAAAAGGCT 
15651 TCATATCATA TCCTACCTCA TAAAATTGCT TGTTCCCATC AAAGTTCCTC 
15701 TTCCCAGCCT ATTTCATTCC TTCCTAGGGG CTCAAGGAAA TTATCTTTAT 
15751 TGTGTCAATA AATTCAGCTG CCACTCTCAG CTTGCTTATG TATCACTCAG 
15801 CAAAAACAAA AACTAGGTAG TTAGCGGAGC CATCTCTATC AATGACTCCT 
15851 CAAGAATGGC CTCTTGCCAG AAGGTGGGCA TATAATGGAA TGAGCTTTCT 
15901 TTTTAACTTG CACCATAAGT GGCTTTTCAA AGTATTCACC AACAAAAGAG 
15951 TATATTCTTA CAGTAAAATT CTATCTTAAG AAAATACATT TTGCATTAGG 
16001 GAAATGCTTC AGGATATTGG TCTTGGCAAA GATATTTTGG GTAAAACCTC 
16051 AAAAGCACAG GCAACAAACA CAAAAATAGA CAAGTGGGAT TACATCAAGA 
16101 TGAAAAGCTT CTGCACAGCA AAGGAAACAG CAGAGTAAAG AGATAACCTA 
16151 CAGAGTGAGA GAAACTATTT GCAAACTATC CCTGTGAAAA AGAATTAATA 
16201 ATCAGAGTAT AAAGGAACTC AAAGAACCCA ATAGCAAAAT AAAAAGATGA 
16251 TAAAAAATGG GCAAAAGATC TGAATAGACA TTTCTCAAAA GAAGATACAA 
16301 AATTGATCAA TGGTATATGA AAAAATTCTC AATATTACTA ATCACCAGGA 
16351 AAATGCAAAA CAAACATAAT GAGATAGCAT TTCATCCCAG TTAAAATGGT 
16401 TTTTATCAAA AAAATTAAAA AATTACAAAT GCTGTCCAGG ATGCAGAGAA 
16451 GGGGAACCCT TGTATGATGT TGGCAGGAAT GTAAATTAGT ACAGCCATTA 
16501 TGGAAAACAG GATAGAGGTT TTTCAAAAAA CTGAAAATGG AAATAGCATA 
16551 TGATTTAGCA ATCCCACTGC TGGGTATATA TCCCTAAAAA AGTAAATCCA 
16601 TTATTATGGA GATATCCACA CTCCCATGTT TATTATAGCA CTATTCACAA 
16651 TAGCCGAAAT ACAGAATCAA CCTAAGTGCC CATTAGTGGA TGAATGCATA 
16701 AAGAAAATGT GGTATACATA CACAAGAAAA TAGTATTCAG CCATAAAAAA 
. 16751 TAATGAAACC CTGTCATTTA CAACCACATG GGTGGAACTG GAGGTCATTA 
16801 TGTTATGTTA AATAAACCAG GCATAAAAAA CCAAATATTG CATGTTCTCA 
16851 CTCACAAATA GGGGCTAAAA ACGTGGATCT CATGGAGATA GAGAGTAGAT 
16901 TGCTGGTTAC TAGAGGCTAG GAAAAGTGGG AAGGTGGGTG GGGTGAAGAG 
16951 AGGTTGAATA TGGGGTACAA ACATACAGTT AAAAGGAATA AAATCTGGTC 
17 001 TTTGATAGTA TAGTAGACTG ACTATAGTTA GCAATAATTT ATGGTGTATT 
17051 TCAAAATAGC TAGAAGAAAA GAATGAGAAT GTTCCCAACA TAAAGAGAAG 
17101 ATAAATATTT GAGGTGATGG ATATTCCAAT TACCCTGATT AGATCATTAT 
17151 ACATTGTATG CATGTATACA TATCACATGT ACCCCCAAAA TATGTACAAA 
17201 TATTATGTAT CAGTAAAAAG CTGTATGAAT ATCCATGCAC CTACTGTTTG 
17251 TTCAGCAATT AATAAATGTC TGTTAACATA ACAAAAGTTT TCTTTTAAAA 
17301 AATGCAGTTA TTATAAGQAT TTATTTTGTA GAACAAGAGC CCAACAGTCA 
17351 ACATTTTAGA GAACTCTTAC AAATTATAAT TAATATTATG CAAGAAACAG 
17401 AAATGGGATA GGATAAAGCT TTATAAAGTA CACATTAACT AAACAGGAAA 
17451 ATATAAAAGT AATTAAAATA TGATTTAACT TAGATACTAT GACATGTTTA 
17501 TATTTGATGC TTTTCAGGTT TAAAGCACCT CCTGTCCCTA ATTTTATCTG 
17551 AACTCTACAA TGATGTCACA AATACACAGG TCCAATATTT TTCTAAAGCT 
17601 CTGTGCTTTG' AAGTGGTGAC TTCTAGGCTA TAAAAATTGT GGCCAACTTA 
17651 TTTCCCTTGT TGTTACTACG GAGTATTACT AGGCACCTCC TCTATATATT 
17701 TCTTCTTTAT ATATCATAGC TGATAGTTTC AACCCCATCC AATTCCATTA 
17751 AAATAATAAA AACATACAAC TTTGAGTGCT GGGTTTGTGC TATTTGCTTT 
17801 ATAGACATTA CTTCAAATTC TCATACTGTT CCAAGATAGT ATATGTACAG 
17851 TTGTATAGAA CATTGGTCCC CAACATTTTG GGCACTAGGA ACCAGTTTCA 
17901 TGGAAGACAA TTTTTCCACG GACGGGGGGC AGGGGGAGAT GGTTTCAGGA 
17951 TGAAACTGTT TCACCTCAGA TCTTCAGGCA TTAGATTATC ATAAGGAGCA 
18001 CGCAACCTAG ATCCCTTGCA TGCACAGTTC ACAATAGGAT TCATGCTCCT 
18051 ATGAGAATCT AATGCTGCTG CTGATCTGAC AGGAGGTGGA GCTCAATGGT 
18101 AATGTGAGTG ATGGTGAGTC GCTCTCTTGC TAGCTGCTCA CATCCTGCTA 
18151 TGCAGCCAGG TTCCTAACAG GCCACAGACT TGGACCAGTC TGCAGCCTAG 
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18201 GGGTTGGGGA CTCCTAGTGT 
18251 TATTAACCCA AACTAGAACA 
18301 TAGGGTTTGG GCTCCTTGCT 
18351 CAGAACCGGA ATGGTTTACT 
18401 TGATAATGGC TACTTAATTT 
18451 GCTCTTTGTT AAATTATGAA 
18501 CACCAAGTGC AGGGTCCTTC 
18551 CCAAGAAGCT CGCCCTGGGA 
18601 CACTGTGGAT ACACATCAAA 
18651 TCCAGTGGTG GAGGCTGTGG 
18701 GTCAGCATCA ACTTCTTTTG 
18751 CTGAAGGATA GGATTTCTTG 
18801 TGTCAATAAC CTGAGAGGAC 
18851 TTCAAACCAA AGTGTGACCA 
18901 CTTTTGACCT AATGCTGGAT 
18951 TGGCGACCAT GAAGCATTCT 
19001 TGATTTGGTC TCTGCTAGTC 
19051 TCAATACCTT AAAGTCATGC 
19101 ATTTTCCTCA AAAATAAAAA 
19151 CAAGCTGCAC TAGAGATTTG 
19201 AAAAGGTGAA TGCATGAATC 
19251 AGCATCCTTC CAAGTCATCT 
19301 TCAAGGTCAT GAGGATCAAA 
19351 AGTGGAGAAT GAGAGAAATA 
19401 GAGAACGAAA CTCATCTCTC 
19451 TTGTTTCTCA AGGTCAAGCT 
19501 GGCTGGGACT CTTAGTTATA 
19551 AGCAGGGCCC TGTGTAATTT 
19601 TTGGCTTCCC CACTTTCTTC 
19651 TATAGATTGG GTCTGCTATG 
19701 AGAATGGACT CAGTTATTCG 
19751 AACAGTTTCT CTAAATAGGT 
19801 CTGTAACACA GGGTCTATCT 
19851 ATCCAGGGAA TCCCAAAACA 
19901 CTCTCTCTCT CTCTCTCAAC 
19951 GATCAAGACT GTTCCTTTGT 
20001 TGTGTGTGCA CCTGTTTGCA 
20051 ATAGTCTAGG GGAATGTACA 
20101 CTTCAACATC TTGTTATTTA 
20151 TTATTAAGTG TTCAGAATTG 
20201 GGACTCAACC TTAAAAGTAC 
20251 TAAAAATCTG GTCTGCAACA 
20301 CAAAAACGCT TCTATTTGTA 
20351 TTTAGATATA TATTTTAAAA 
20401 AAATTTCTCA TCTTTGTTAT 
20451 GTTAGCTTTG AGGATTTTAT 
20501 TCCTCTTTTT TCTAAATAAC 
20551 TTTCTCTTTG ATTTCCCTTA 
20601 AAATCCCCGT AAATGTAAAA 
20651 AACATGACAG CCTTCTTATC 
20701 GCCAAGTCTC AATTGAAATG 
20751 CATTCAAGTT ACATTGTATA 
20801 TCATTTGAAC TTGCACATTT 
20851 AGGATTTJAC ATCAATCATT 
20901 TAAGGCTGGG AGCTAGGTCA 
20951 CGGCTGCTTT CAGGAAACGT 
21001 TTGAGATGCT GCCCCACAGA 
21051 TGGAGATTAG AGGTGTCAAT 
21101 TTGACTCCCC CAAAACTTAT 
21151 GATTGGTAAC ATAAATAGTT 
21201 TCTGTTTTCT TATAATAAAG 
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AGAAGATTAT TTAACTTTAA TATTAAAGAT 
TTGGTAAATG CTGGGAATAC CATTCAAACT 
TTCTCTATCT TGTTTCCTAT GAGTAGAATC 
TCAGTTTTAA AAATATTAAT AGTTATTGAA 
GTGAGTCCCA GGTCAAAATG AAAATGCAAG 
GAATTCCAAG ATAATGACAG CAGAGCAAAA 
CAGTCCCTGA GTGGCTGCAC AGCTCACTGC 
TTTGGCTTAT GCGACAGCTG CTGTACCAGG 
ATAAGGTATC TTCCTCCTGC TTGGGGAGCT 
AGAAATCTCT CTTTTGGAAT TGTCTGTAGA 
AATATCTTCA GAGTGGCAAA TCTTTTCTCT 
AAATAGCCAA ACTCACTCTG TTACAAGCCT 
AGCCTGGGTA ATTAGTAATG GAGCCTTTGG 
TGAAGAAGGA AACTTGATTA TCTTGTGAGG 
ATAGGTACAA TAAAAAGGCT CCAAAAATAT 
CAGATGGGCT GACTACATTG AAAGCAGTGC 
AGCAATAGAA ATTAAAGAGT ATAGTCAGCT 
TTTCTATGCT TGGTAGATTG CTACTTTGGT 
ATTATCCTTA GCACTACAAA CTTTGATAAC 
TAATAGTAGA AATAAACTTG AATTCAGGAA 
TTGTGATATG ATGAAATGAT AGAGTCGGAT 
GGGCAAGGCT TTCAAGACCA TCTCTCCCTA 
TGAGAACGTG TGTTTTAATG TACCTTGTAA 
TAAGAAATTA CATAGCCTCT TCATTTTACA 
TGGGCTTGGT CATCTGTCTT CTGACTCTCT 
TTTATTGGTT GGGTCTCGTT CAACAACCGT 
ATGCAGCTGC TATCTTCTCA CCACTACAGT 
CTGGGTGATG AATTAGACCT GCCTAGTCCC 
CCTTGAGTTA GAGGCTAGAT GACTTACATG 
AACAAGGTAT GTGCTGGGAA AATGAAATTA 
AAGGTTTATT TTCCCAACCA GGCAAGGGAG 
AGGCTTGGGT AGCTGTCAAC TCTAATCACT 
ATAAATGACT AGAACCTGGA ATCTCATGTC 
GAAACCATCA CTCCAATTAC GAGGCGTTTT 
ACTTTGGCTA AATTAGGCTG TGGACTGAAT 
TCCTTTGTTT ACTTCCTGTA TGTTCAATGG 
TGTGCATGAG TGTGTGTGGG TTTGATTTTC 
ATTGCTTAAT CAATTGAAAT AAAACTACCT 
TTTCATTATT AACTTTGGGT TTATTTATTA 
GAATTTCTGA GGTGAAAATG CAATGTCCAT 
CACATGTGCT AGAATGCACA TATATATGGT 
AGTCTTATCA TCAGTTTACA AGCTTCTCCC 
AAATTTCTCT TCTATAGCTT GAACCAATGC 
ACAGGATTTA GCTCAGTACC CTTCCTTTTA 
GAAGGATTCA AATAACACCA ACTTGGAAAT 
TTTTAATAGA AAACAGCTAG CTGAATGATG 
CTGATTTCTC AGGCTATCTT GAGAGTTTTA 
CCACTTCCTA TCTTATCTCT CCGGATAATC 
CCCTTCTCCT TTGAAGCTGC TGCCATTTAG 
TCTCTTCCCT GAGCTGCTTC TTCAGCTCAT 
TATCTTTTCT TCTTCTCTGT TATGATCCAA 
ATATTTAGGG ATATGAAATG TGTGTATATT 
TGCTACACTG AGTTAATAGC ATCACTGATA 
CAGAAATATT TACTTAGTTG CTGATACGTA 
CATGCAAGGA TACATGTAAT GCATGGTCTT 
CTGCATTCTA GCACAAGTAA ACTTATGGCT 
AAGGAGACTA CAGTGGATCC TTGAACAATA 
CCCTACACCG TCAAAAATTT GTATAGAACT 
CTATTAATAG CCTACTCTTA ACTTGAAGCC 
GATTAATGTA TTTCATGTTA CATGACTTAT 
TAAGCTAGAG AAAAGAAAAT GTTATTAAAA 
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21251 TCACAAAGAA GAGAAAACAT ATTTACTATT TAGTAAGTGG AAATGGATCT 
21301 TATAAAAGTC TTTATCCTTG TTGTCTTCAC ATTGAGTAGG CTGAGGAGGA 
21351 GGAAGAGAAG GGGTTGGTGT TGCTGTCTCA GGGGTGGCAA ATGAGGGAGA 
21401 GAATTCACAT GCAAGTGGAT CTGCCTATTT AAAACCAGTG TTGTTCAAAG 
21451 GTCAACTGTA TTTGCTACGT TGCATGACTT GTTGTGCAGA CATTGACCTG 
21501 AACTTTGCTT CTGTTATAGC ATCTTCACCC ACTGTGACAT TTGCTGGTAG 
21551 TTGGTTCCTC TCCGTGTACT GCCCACTTCC ATTTATCTTT GGATGTTCAG 
21601 CAGAGTGCAT TGCACATGCA ATAACTTTGT GAATGACTGA TGGAGGAAAT 
21651 GCATGTCTAA GTATTCTTCA AGGGCCGCTT TCCCCTAGTT CTGATCTAAT 
21701 ACTGTTTGTG TAGAGTCCTG TTAAATTCTA GTGCAGCATG ATCTCTGTCA 
21751 GATATTAAAT TCATGGTTTT GTCTCCATTC CAGAAAGAGG TGTCAAGGGT 
21801 TTTTCTGGTC TGCCTTTACG TAGAATTACT TTATCTCACC TATCCTTCCT 
21851 TCTTCCCCAC CCCAAGGTAA ACAAGCTTAT GTTTCACAGT TTATTCTCCC 
21901 TTGGAGAAAT CAAGCTCTAC TGTTCATAGA CCACGATGAC CTGAACTGTT 
21951 TTGGATGTCA TGACCCACGG GGAGCAGTGT GTGGATCTTG AGAAGGATGG 
22001 AGGAAAGAAA GGTGATTGGA GCTGCACTCC TCACTGGGGC AAGACTGCTG 
22051 AGGCCATCTG GTTTTGAACC TGGTTTGTAA CCTAGGGCGT CTGGAGGGGG 
22101 ACAGTGCTGC GGTAGCAGCA CAGCCCAATG CCCACCCTCC AGGTGATATG 
22151 GGGCCTCATC CCATACTGTC TTCTAGGAAC GGCTCCCTTG GCCTGTAGTT 
22201 TGAGCCTCAG TTGTGTCTGT GTTTTGAAGA AGGTGACTGC TGATCTGGCC 
22251 ACTTCCTCCA TGCTGAGCTG TGGTGCTGAC CCCTGTGAAT TGCTTCTGTT 
22301 TACTAAGTTA TAGCGTCTGC TTTATCAGTA TGCTTTTCAT TTATTTGAAG 
22351 CTACTTCTCA CCAATTTCAT GTTCCATATG AGACCTTGGG GCAGGGATTT 
22401 AAACCTCCTA TTCTATGGGG TGAGTGAGAC ATAAAGGTAA ATTTTGAGTT 
22451 GGAGAAACAT GTGGTAGATA GTAATTCCTA TTAAAATAGG ACCATCTCAA 
22501 CTCAACAGAA CAAAAATACA GACTTATTTC CTGCAGCAAA ATAAACCCAC 
22551 ATGTGAAGAT GTAATCTGCT CAGGCCCCCA GATGCTCCTA ACAAAGGGAG 
22601 CATTAGCCTT TAGTGTCCAG ATTGCCAGTG TCCATTCTGT AAATCTTTTC 
22651 TCCTCTATAT TTACTCTTCA GCACCTCCAT ATTTTCCTTA TTTCTTATTC 
22701 CTTTTATTGT CACATTTGCT CACAAGAAAG GGTGTAATGT TTGTTGCTGT 
22751 CTGTTGATGG AGGAAACTTC AAACTGAAGC ATCTCTTCTA GGGGTGCCTG 
22801 TCTTTTAATA AGCACAGGAT TGGTAAACCA AAAAACTAGA ACTCAACAAA 
22851 TGAATTAAAG TGTTGACCAG AAAGGCCTAA GACATTAGAG TCAACAGACA 
22901 TTCTTGGTTC TTCTGAAATC TGAAGTGTGA GGACTTATCC GCCCCACCTT 
22951 AATGGTTTGG AACATCACTG GTGACAGCCA TCTTGTTTAA AGCTTTTATT 
23001 TGTATCAAAG TATATTTCAC AGGAGTTTAT GGTAAGTATC ACCCATGATT 
23 051 CCAACCTCCA AACTAAAGAA TGGGACTTGA AGTGAATCTT TCTGGAAATG 
23101 GCAGTTATAA ATCTCCCCCC TCCTCTTTGT CAGCATGCCC AGTGGTGGGA 
23151 ACTGAAGCCT GAGAGGTTCC CAACATTTCA ACCTGATGTT TATGTTTGAA 
23201 CCCATCTTGA CGTCATGAAA ATCCACACTT CTTTTGGTTA GGAGGGACTG 
23251 AGCAGCCTTG CTGCTGGCTT ATACTTGAGT CTTCTCATAT TAGGGACATT 
23301 CAGTGGTACA AGGAGAGGAA AGCAGGGAAA ATTGGGAGGA GAAATCCCTA 
23351 ACCTAATGTC TTCATCAACC CCTTTTGCTC ACTGGGACCA GCCACCATAA 
23401 AAAGTCTGTT CCTTTCTGCT TTGTGGAATG TCTCCAGGAT GGTCAAGGGG 
23451 AACCTGTTTA TTCCATCCAT TTTGTCTGTG TTCACAGACT TGGCTGAGGA 
23501 TTTAGTTCTT GCTGTTCATA GCCGTCCCAG TTGGACCTGA AGCTTCCAAG 
23551 AGTGCCTTTA TGAATGAAGA ATTCAATTAC CCACATACCC CAGCCCCAGC 
.23601 AACCAGACCC TCAGCCTGGA TAGAAACCTG GATCGCCACT TAATGCAAAC 
23 651 AGAGAGGTCA GGAAGCAGAG TAGGGTAGTT GTATTAGTTT TTTTTTTTCT 
23701 TCATAGCATA CAAATTTAAC AGCTTAAAAC AAAAGAAACC TAATATCTCA 
23751 TAGTCTCCAT ATGTAATATT CTGGGCTTGG GTTAACTTGA TCCTTTGCTC 
23 801 AGGGTCTCAA TTGACTTAAA TCAAGGTGTT TTCTGGGGCT GCAATATTAT 
23851 TTTAGTGTCT CATCTGAGGG CCCTCTTCCA GGCTCACTAG TAGTCCAAAG 
23901 AATTCAATTC CTTGTGGCTG TAGGAGTCAA GGCTGCTTGC TCCTTCAAGA 
23951 CCAGTAGCTG AATCTCTCTG ACTTCCGTGA AGAACTCAGC CTTTCTTTCA 
24001 AAGGGCTCAC TTAATTGGGT TAGGTCCACC CATGATAATC TCCCTTGTGG 
24051 TTAACTTTGC AAAATTTCTT TTGTCTTATA ATGTAACATA ATCACAGAGT 
24101 GACGTTCCAT CTTATTAACA AATCCTGCTC ATACTCATGG AAAGGGCTTA 
24151 TAGAAGATGT GCACATCAGG GAATGATGGC AATCTTAGGG GCCATCTTAG 
24201 AATTCTGCCT ACCAAAATAA CGAAAATAGC ATGATCCTGG GAGACAGACC 
24251 GTGdTGGGTT TCATTCCTGT TCTTGGCTTT TACTGTCTTA TCAGGTAATA 
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TTGGGATCTG 

CAGTCTTACT 

GCCAGCCACA 

AAGATGCCAG 

GTAATGCTGA 

ATTTTGAAAT 

TGTCAAGTGA 

GTTCAGCCTC 

CAGACCATGC 

ACTTTTATAA 

ATTATAGAAT 

AACATTTGGT 

ATATTATTGG 

CTTAACATTA 

TTAAAACCAT 

GTTAGCCTGT 

TAAATTCCAT 

CCCAGCAGCT 

TTGAAAGGCT 

AATTTACTTT 

AAACTGCAGT 

GCTTCTCTCC 

GCAATTTATG 

GGTCTGTTCT 

ATAGGTAAGT 

ATTCCTTTGG 

TAATTGAATT 

CCCTGCTTTC 

AAAGATGCTT 

CCCAAGATTT 

TACTCCCATG 

ATGATTATCT 

GAGAGTTTTT 

CAAGGGGAAT 

GACTGTGTGA 

GGCTGACAAC 

ACTGAATTCT 

AATGTCTAGA 

GATAGCCTGA 

TGTAAGCTGG 

TGCTCCACAG 

CCAGTGAAGC 

TATCAAAATT 

TACAATATAA 

TTGAGGTATC 

CCTGCAATGC 

ACCAGGCCTT 

CAACCCACCG 

AATGCTGGAA 

CCCAAAAAAG 

AAGAACAGGA 

GCATTGCTCA 

GCTTAAAGAC 

GGATATGAAC 

TGTGGCTGAC 

TGGCAGAAAA 

AGCTATGGAG 

GTCTCCCAGC 

AACAGAGCCC 

ACAGAGGGAT 

TTGATAGGCA 



ATACTTCATC 

ATACAGATTT 

CGGTGAGCCC 

ACTTGGCTAA 

TATGTTATGG 

ATTGAGGAAC 

TAACTGGGTA 

AACTACAAAA 

ATCTGATTAT 

ATAAGTTTGA 

TCAGGGACCA 

TTATTTCTTT 

TACCTTTCTA 

GATCATAGAC 

CATTTTAAAT 

GTGATTATTT 

AAGGAAAAGA 

AATAGTGCCT 

GAAAGAATAA 

CTGCTCTCTT 

GGGCATTGCA 

TCAGTTTAGA 

AGTATCCACA 

TGTACAAGTG 

GTGTGTACTT 

GTTATGGATT 

TGACAAGTAA 

CTTCTGACCT 

TTATAATGAG 

TATACTTTAA 

ATTATGTCAG 

GGGTGGGCCA 

CTCCAGCTAA 

TTGATGTGCT 

GAAGGAATGC 

CAATGAGGAA 

GCCAACACGC 

TGAGGGTCCA 

GCAAGAGAAT 

TGAATGGATG 

CAATAGAAAA 

AGGCTCTCAG 

ATCCAGGAAT 

CTTACCATTT 

TCCAATTACT 

CCAGAGAGGA 

GGAATTACTT 

GCAGATAACA 

AGCACCCAAA 

AAAAACAAAC 

AGAGACAAGA 

CGGCTACGGC 

TAAAGGAATG 

CACAAAAAAA 

AACTAACTAA 

GGAGTCAGAG 

AGCCAGATCT 

TGCTCACAGC 

AAGGCTAGAA 

GAACTCTATT 

CAGGAGAGAA 
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TGTCAAATGA 
AAGGTAATGT 
TCCGTAGTAG 
GGAGGGAGAG 
GCCTAATTAT 
ATCTGCAAGT 
GTTTCACATT 
TTCAAACAAA 
AGGGGTAGGT 
TTTTTCTATT 
CATGTAAATA 
CCAGTGTTTT 
TACCTATAAT 
AGATTTCTCT 
GTCTGCACCC 
GATTAATGTC 
ACCAGGTCTC 
GGCACACGAT 
ATAAATGGCT 
GTTTTTAACA 
CATAAATCAT 
GTGCAAATTG 
CATGAATACC 
ACCAACCTCT 
TTTTCGATGA 
ATAACAGAGA 
GACCCGAAAT 
CTGTCCTTGG 
ACTACAGTAG 
TATCCAAAAC 
GTTAGATATC 
GATCTAACCA 
TGGCAGAAGA 
GCTGCAGGCT 
AGTGGCATGC 
ATGGAGATCT 
TGAATGAGCT 
GCCTGGCCAG 
ATAGTCATGC 
TTGTTTTAAG 
CTATCACAAG" 
TTGTGTAAAC 
CTGGCTTCTA 
TACAGGAAAA 
AACTATGAAG 
CGCATTCAAA 
CTTTAAAAAT 
ATTATAAAAA 
AGCAGAGAGA 
AAACAAACAA 
ATTTTGAGTT 
TCTTTCCTCT 
AAGCTGAAAA 
GAAGAAATTA 
ACATGCCCTG 
ACCAGAGATA 
GTGAATTTCA 
TTGGGAGGCA 
ATTTGGGCAA 
TCTACCCAAA 
CTAGGAGGTT 
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GGCTGATAAT 
GTATAAAGAT 
TGATACTATG 
GTGGGGTATG 
TGCTGAAAAA 
CCATGTAAAG 
TATTTTTATG 
GAAGATGTAA 
AGAATGAAAG 
TATAAAAGTA 
TAAAGAAGAC 
CTTCTTGCAC 
TACAGATCAT 
GTTTTTGGAC 
TTCAGAGGTT 
TTTCTTCCTC 
TTTTGCATTA 
AGGTAAACAA 
ACATCTTGTG 
TTTTTTCATT 
TTTCCATGTG 
CATGCTGCAG 
GGCAGAAAAT 
CTTCCTACAG 
CAGGAGCCCA 
ACATACTGGC 
GGGTAAAGGT 
GTGGCATTTG 
GCAAAATTCT 
CCATGCAGAT 
ACAGTTGACT 
AGTGAACTCC 
AGTCAGAGAG 
TGGAGATGTG 
AGGGGCTGAG 
CAGTCTTACA 
TGGAAGCAGT 
CAACTTGCTT 
TGATGTCTGA 
CTGCTCAGTT 
TACCTACCCT 
TCTTATAGCT 
GGTCTTAATT 
AATTCCTGAT 
ACTTTTAGCT 
ATCAGAAGAA 
GGAACATTGA 
CTGTATAAAA 
AGCTAAGGGA 
AACCAAAGCT 
CAGGGCTTCC 
ACCGTCTTAG 
TTGTAAAGAG 
GAGCATTAAC 
AGGGAGCCTT 
AATCTGCTCT 
TGCTTTTCTT 
GAAATACTAA 
TTCTTAGAAG 
TCCTTGGCTG 
CAGAATGAAA 



GGCTACTGTG 
CTAGCATGCT 
ATAAGAGGAG 
TGCAGCTGCA 
CTTTTGAAGG 
AGTGGGGATA 
TTTCCAATTA 
AAATCTGGCC 
TGATTTCTTT 
ATATATGCAA 
AACCACTATT 
AGCTCATGTC 
TCTTAATCAT 
AATTTTTAAA 
GCAGTTTTAT 
AATTGGAATA 
TTGTTGTATC 
TAAATATTTG 
GATGTGCTAT 
ACTATAAATG 
TAAATTGGTT 

ttcccaaaat 
acagaggact 
gggttaaaaa 
gtctcattgg 
tgttgattaa 
cgaaattgtc 
atctagcaaa 
gaaaatggac 
aatgagctag 
ttaggataag 
tttgaaagca 
atttgaagca 
aagatggagg 
agaggcctct 
actgtgagga 
ttctctccac 
taaacctcgt 
cctatagaac 
t.gtggcgatt 
aatgttaaat 
actagttcta 
cttagagctt 

TGCTATTTTC 
ACTACAGTCT 
AGTATTTGAA 
AGTGGTTATG 
TAAAACAGGA 
GAATTTGACC 
GTACAACCTG 
TTGGCTGATG 
AAGCTAATAT 
GAATCATGAA 
TGCCCAGATC 
TCAGGAAGCA 
TGAAAGATAA 
AATTTAATAC 
AGGGTCAGAG 
CAAGGAAACT 
ACAGCCAAAT 
AATCAGCAAT 
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27351 AAGAGGACAT AAGAATAAGA AATAAGAGCT GTATACCAAG GGAAAGGAAA 
27401 TATTGGTGGT TATATTCCAG GCAACTTATG CTGCCTAATA GAACAGAA&T 
27451 CTAACAATTT TTATTAACAC ATGGCAGAAT CCAAGTTGCT ATCCTGTATT 
27501 ATCTGTGATG TCTACTTTTC AACCA&AACT ACTAGATGTG TCCTATTGCT 
27551 GTACAACAAA TTACTTCAAA ACTTAGCAGT TTAAACAACA ACAAACACCT 
27601 ATTTTTAAAA ATCACATTTA TTTTCTGAGA ATCAGTATCT TAAAGAGGCT 
27651 TAGCTGTGTA GTTCTAGCTC AGAGTCTCAT AACATTGCCA TCAAAATGTT 
27701 TACTGGGGCT GCAGTCATCT GAAGGCATGA CTGGTGCTGC AAAAACCACT 
27751 TCCAAACTCA GTTATATGGT TGTTGGCATG CATGCATTCT CCAGTTTCAT 
27801 TTAATCTCCA GTTTCCTCCA ATGGTGGTAG GCTTGGACAA TAAAGGGCAA 
27851 GTTCTGTACA AAGCCCAGAC ATGGAATATG CAAGATCAGA GCCTAAGTTA 
27901 GTTGTGAACA TCAGGGTGCC TCTGTTTACT TCTTTATAAC ACTTTAAATT 
27951 GTGTGTTTTA ATGGAGATGA AATTAATATA ATATGGGAAT AAAATTAGAT 
28001 TACTATAATA CAAAAATACA CATATATGTA TGTATATTTA TATGTATGTG 
28051 TGTGTAACAT TAGTTGGGAT TATGGAATGT TATAATGTTA AGGGTTTTAA 
28101 GGATTATTAA AAAATTTAAT TAATATTGGC CAGGCACGGA GGCTCACGCC 
28151 TGTAATCCCA GCACTTTGGG AGGCCGAGGC GGGTGGATTA CCTGAGATCA 
28201 GGAGTTTGAG ACTAGCGTGA CCAACATGGG GAAATCCTGT CTCTACTAAA 
28251 AATACAAAAA ATTTAGCTGG GCGTGGTGGT GGGTGCCTGT AATCCCAGCT 
28301 ACTCGGGAGG CTGTAGCAGT AGAATAGCTT GAACCCGGGA AGCGGAGGTT 
28351 GCAGTGAGCT GAGATTGCGC CACTGCACTC CAGCCTGGGT GACAAAGCGA 
28401 GACTCTGTCT CAAAAAAAAT GATAATGATA ATAATAATAA TTTAATATAT 
28451 TGCCAACATT TAATTTAGAA AATCAGTTTG TGAGTCATAT AGTTTAATTT 
28501 TGTAATTTGA ATTCCCATAG TTCTGGCACT TAAATTTCTT GAGTCTGAGG 
28551 TTATGATCTC TGAGTTTACC TCAGAGATCA AGTTCTTAAG AAACTACCCC 
28601 CAATATATTT AGAGGAGAAG AAGGCCTCTG CTATTTACTG TATGCATGTA • 
28651 TATAACTTTG CCGAGTCCTG GTTTCCACCT CTGTAATATT AGAATAATAG 
28701 TATCCACTTA TAAAATTGAT AGGGAGTTTT AGTGAGATAT CATGGGAAAT 
28751 AGCTGTGCAT GTGGAACATG CCAGACTGTA CAAGGAAAAG GTTCAATTAA 
28801 GGCTAGTCTT GTTTCTCAGC CTCCATATCC TTCTCACGGT ATTTGCTTCT 
28851 CAGTATAATT TTTCTTGTTT CTCTATGTTG CACAGTAATA TGAAATGTTT 
28901 TAGGACCATA TCTGATCCTA TTATTCCACA TCTATCTCAA TTATGCATTC 
28951 TTGAGGTTAC CCAGAACCCC TCTCTTAGGA AATGGTGGCA TTAATGGGAT 
29001 ATTAAATAAA GTATGAAATA TTTATTCAGT GGGGGTATTT AAAGCAATTT 
29051 CCAACAGAAG CAGAATGAAT GTTTATTAAT TAACTTCTAA AAATAGTGGC 
29101 TTTTCCATGT GGTACCGCTA GTTCTCATTA CTGCATTCTT GGTATTGCAT 
29151 ACCTAATCCC TGAAAATATT GATGCGATCG GACTTAACTG TCTTTTCTTA 
29201 TTTCTCATCA CCAATGACAT TAGGTTTGAT TACAATCAGG CCGGATTGTC 
29251 CACTCCCAAC CCCTGCGGAG ATCGCTTCCA TTCTGTGTCA CTTCTTAATT 
29301 TCCTAAATCA CATTGTCTCT CGTTTCCTTA CGTGAGTGAG GCACAGGCCA 
29351 CTAGAAGCTT AGGAAACATG TATTTTTTTC ATAAAAAGAC CATTCATTTT 
29401 TTCCCTTTGG ACAATGCTTT TTTCTTCCTC TAGCCTTTCT CATTTGCATA 
29451 CACATGCAAC CACCAAGGTA TGCACAGAGA GTCTCTGTCC TCTGATCTCC 
29501 CCCCAAAAAA ACGTATGAGA AGAAAACACA GGGCAAGTGC CATGAACAAA 
29551 TTTGAAATTT AGCAAGAAAG ACAAGAAATG AAGCAGTTGG TTCCCAATAG 
29601 CCCTTCCCTA CCTTGATAAT GTGAACGTGA ATAAAAGAAA TAAACATGCT 
29651 ATTAGAAAAT GGATATGCCC TCCCTGTTAC CCAGAGTCCT AAAGAAATGA 
29701 AAACTCAACT TTGAGGCTGA GTTTAGATTA TCTTCAAGCT TGTCTTCCTC 
29751 TTGTTTCCTT TTTCCTTTGC ATCTTTGAGA AGGAGTGATT TGTGGTTGTA 
29801 AATAGTTGTC ATAACATCAG AATGGCTCTG ACAAATTTGT CTTCAAAATA 
29851 GGATCCCTCT TCACCTCCCC ACCTGGACTG ACAGTTAAGC CTCAAGTTGC 
29901 CCCTTGTATC CTGACACCTC AGCTCAAGAC AACTCATTTC CTGGAGGGAA 
29951 TACAACCAGA AGATATTGAT TCTGCCTGGG GCAGGCCAGC TGGCCCCATG 
30001 ATCCAGGGTG TCAACAAAAT GACATTTACC CAGGGGACTT TGCCAGAGCT 
30051 CATAATCTAA CTTCCAGCCA TGCAGGCTTA GAGAGTAGCT TTTGTCTCCA 
30101 CACTAGGAGT GGGTGGAATC ACAATAGCCA ACCTGTAGAA AATGATGGAT 
30151 GAGCTGCAAA TGGAAAACGT TCATGGGTAA AAGGGAAACT TAATTAAAAG 
30201 GGAAAGTGTT GAATTTCAAG GATGAAGAAA GAATTATAAA AGCATCACAG 
30251 CATTAAAACA ATAATAACAA TCAACCAGCC AACAATAAAA ACAAATATCT 
30301 AAGAAGGAAA ATAAAAATCT GGACGGCCCT CAGACTTCTT TGCAATGTTA 
30351 GATGCCAGAA GATAATGAAG CAATGATTAC AGTAACGAAG CAATGATTAC 
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30401 AACAGCATGA TGGAAAAGGT TATGCCTCCA CCTTCTATCT TTAGCCAAAC 
30451 TGTCATTCAT CTGAAAACAA CAAAAAAGAA TGCATGAATA TGAAAGGACT 
30501 CAGAAATAAA GCACTTAGGC ATGTTTCTTA AAAACAAAAA TTTTTTTTAA 
30551 TATTTTTTTA AGAAGGAAGA AAATAAATCA ACTATTCAAA ATAAATAAAA 
30601 AAGAACAGCA AATCAAATAT TAGGTTGTGG TGAAAGTAAC TTTTGCACCA 
30651 GCCTAATACA TAGAGATAGA AGGAAAGAAT TAATACAGAT TAAAGCCCAG 
30701 TTTTATAAAT TAGAAAACAA AAAAGAATAC AATTTATATT TAATTGCAAG 
30751 AGTGTATTTT GTGAGAAAAT AGATAAAATA TTAAGTGATT CGAAGAGCTA 
30801 AAAATGCAAA ATTGACAATG CAATAAACAT ATTACATCAA TTATATGAAA 
30851 ATACTTTGTA CAATTCTAGG CTAATTATCT TCAAAGTATC ATTTGAATAT 
30901 ATATATATAT ATTTTGTCAG ATTACTAAAA TTGGTAGAAA TAAGATAGAA 
30951 GATATCACTA TATAAATATC TTTTAAAAAA TCAAAATCTT GTCTGAAGAA 
31001 TTAGCTCCAA GAAAGGTGTT AGTTTAGATG GTTTCCTAAC TGTTTTGTGG, 
31051 TCCTTCAGGG GATAGATAAA GGATGTTGAT TATCTAACTT TACAGAGCAC 
31101 AGGAAAGGAA GTCCTTTGCA ATAATTCTAA TTATTTTTAT ATGGGTAGTG 
31151 TAAGTCTGGT AGCCCAAACT TGAAAAAGTT GGCATACAAT AAAAAACAGA 
31201 TATCTGACTA ATTTTTTATT TATATATGAA GTAAAGATCT TTAATAATAT 
31251 ATTAGAAAAT CAAATCCAGG AGTATATTCA ATGAATAATT CACCATGACT 
31301 AAGTGGGATT TATTCCAGCA AAGAAGGGAT GGCACAACAT TGAGAAATCT 
31351 ATTAATATAA TTCATGATAT TAAGAGGTTA AAGAGGAAAA CATTATTTCA 
31401 CAGTTGTCAA ATATGTATTT GATAAAATTT AAGATTATGC TTTAAAAAAC 
31451 AAGATCTCAG TTCCTTGGAA TTAGATATAT CATTCCTCAA CACAAGGGGC 
31501 ATGGTGGGGG ATCCATCTCC AGCCAACGTC AGTTCCTGGC TCGCCAAAGA 
31551 TACACTGGAT TAGATGAGGT CAGATGAAAT AAAACAATAA TACCTACTGT 
31601 CATCCCTACT ATGTAAGATG ATTTTGGAAG TATTAGTTTA TAAAAGTAGA 
31651 GATTGAAAAA TAACAAACAA AAGAGACATG ATAATTGGAA AAGAAGAAGT 
31701 AAGTTTATTA TTTACAGGAG ATATAATGAG TAACTGTCTG GGAAACCTAA 
31751 CACAATCACC TGAAATCAGC TATTGGAAAA CCTAAGGGAA GTTCAAAAAC 
31801 GTTTCAATGA TTGTTTTACT GTATAGCAAT AATCATTAAC TAGAGAATAA 
31851 ATTGGAAAAC AAACCTCATT TAGAGTAGTA CCCAAAAGAA GGAATTACTG 
31901 GGAAAAAGCT TACCACATCC TAAAGCCATA GTGTGATTCC ACTATGTGGT 
31951 TGGGTTCAGC TTTGGTAAAG GACAGACTGG TTACAACTTC AGATCTTCCT 
32001 TTTATTAGTA GTGTGGCCAG TTAACTAACC CAAACTTGCA TTCTCCTCTG 
32051 TAAAACCAAG AGAATAATGT CTCACGAGAG TTTTATGAAG ATCAAATGAG 
32101 AAAATGTACC CAAAATTCTC AATGAAATGC TAGCACATAA GATTATTATT 
32151 GAGAAAATCC CAAAACTCTC CTGGCACTTG GTAACTAAAT TGATCTCAAG 
32201 AAAAGCTGAT TATCTGCACA AATGTGTATG CATAAAGAAT ATGTACAAGT 
32251 ATGACCTCTG TGGGTTTTTT GTACAGAAAA ATTAGAAACA ATTATATGTC 
32301 AACATAGGTG ATTGACGGAA TACATTTTAT ACATCTAAAA CGTGTGATAT 
32351 TTGTGTTGAT TTAAAAATAC AATATGGCAG TATATTTATT GACCAGTAAA 
32401 GATTGTCATA ATACATCCAG TGAAAAGATG AGCAAAAATA GAATTTATAG 
32451 TCCTATTTTG GTAAAAATTC TGTATATAAT CATATGTATT TAACTGAAAA 
32501 ATAAAAAGGA CTAAGATATA TGCTAAAATA TTAGTAATGA TTTGCTATGG 
32551 CTTATCAGAT TTCTTTCCAC TGATGTTTTG TTTAGAAAAT TGGGGTTTTC 
32601 TAACAATTCT GTAACAAATA TTAATTTCTT ACATTAACAA AAAAAGTAAG 
32651 AAACAGGTTG CTTGAGGTCC TCTCTTGCAC ACTGAAATCA CTTGTCAAAT 
32701 CCTAGAGGCC AAGACCCCCT CTCTCCTGAT AGAGTTTAGG TGGACAGAGC 
32751 TTAGGTTGTC CACACTTATC TCTCAGTCTT TTTTTTTCTT CCAATCTATG 
32801 TTCTCTGTTG GATGAAACCC TACCACCATA TGACTGACAT GTTTCTTTAT 
32851 ATATGTCAAT ATCTTTGGTC AAGAATTGCT TTGACTTGGG AACCGCTCTG 
32901 TCTGTGATGT GATTTGTACA CCTAAGCTCT CACCATTCAT TCAACAAATG 
32951 TTGACCCAGC ATCCAGACAT TTGGCTAGGT ACTCAATATG AGGTGTTTGC 
33001 TGGGGGGCCT AGGGAGACCA ATAGAGAGCA CGGCTCTTTC CTGAGTAGGT 
33051 TCTGAACCAA CGGAGAAGAC AAAAGAGAGG GCAGATGGAG AAACAAAACG 
33101 TGTGTGTTAG TTTCTTACAT GAGGAGCAGC TCAGCTAAGG GAAGTTGGGG 
33151 TCTGAAGAGC GATCTTGCTC TCCTATGTCA TCGTCATTGC CTCTCACGTC 
33201 CCAGTGGAGT GGGAAAGGGG ATCCACATCT CCTGCGCAAG AGTGTGGCTC 
33251 TGGTAGTGCC TCTTGCTTGA GAATCATGGA TAAAATGCTC TCTCTGGATA 
33301 ACCTTTTCTG CCTCTCCTAA AGAAGATTGA GGAAACCGAG TGGAAAAAAA 
33351 TTAGGAGGAT GATGCTAAAA CTCTTTTCTC CCTGTGAAAG GAAAAGGCCA 
33401 GGGCTACGGA TGTCCTCTGT GTGGATGTGA CTGCAGTCTC AAAGGAAATC 
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33451 TGCTCCTAGA TTCAAAGACA TTCCCCCACT CCTAATTACT GCACTTTTCA 
33501 ATATCTTTTG GGACGCCTGT GGGCAGCACT GATTTTACCC CTGGATGATG 
33551 CTGAAATGGG GGAGAAGAGT ATATTAATTA AATTAAATTT GAGGCTACTC 
33601 AATAAGGTTT GGCTGTGTCC CTACCAAAAT CGCATCTTGA ATTGTAGCTC 
33651 CATAATTCCC ATGTGTCATG GGAGAGATCC CATGGGAGGT AATTGAATCA 
33701 TGGGGGTGAA TTTTTCTTGT GCTTTTCTTA CGATAGTGAA TGTCTCATGA 
33751 GAACTGATGG TTTTATAAAG GGGAGTTCCC CTGCACATGC TCTCTTGCTT 
33801 GCCACCATGT AAGATGCACC TTTGCTCCTC CTTTGTCTTC TGCCATGATG 
33851 GTGAGGCCTC CCTAGCCATG TGGAACTGTG AGTCCATTAA ACCTGTTTTC 
33901 TTTGTAAATT ACTCAGTCTC AGGTATGTCT TTATTAGCAG CATGAGAACT 
33951 GACTAATACA CTACTCTATC CCAAAAGCTG AAAATTCATT CTTAGTGGCA 
34001 CTTACCTTTC CAAATTATGT CTAGCCATGT TTTCCAATAT AGCCCACCAG 
34051 AGATAATACA ATAAAATTAA ATGACAACAT TGAGATATTC TCAATAATAA 
34101 AATGTCAAAA ACATGGAGAT ACATCAAATT GAGTATAAAC ATCCTCTTAT 
34151 TATCAAATTC CTGAGGAGAA GTTAGTCCAA AAATTACACT GTTTTGCAAG 
34201 TTCAGGTGCA AAAGAATTAG TCAGAAATTA AATTTGCTTG CTCTTGTGAA 
34251 TTAAAGGCCT AATGGAATCA AAGTTCTCTC CCGTTTCTTA TTTTGTTCAT 
34301 TTTTAGCCAA AGGTGGTCTT TTAGGTGGAA CTGAGGTTCT TTCAGTAGCT 
34351 TTCTTTGGGA GTGATAATGA TCTCTTCAGG ACCTCAGAAG ATGAATTACT 
34401 CTAATGCCAA TCTCCAATGA GCTTCACCTT TAAGACTTCA AATGTATTAG 
34451 GGCATTAATG TTTCCTAGTA TTGACCCCCA TCCCCACGAC ATTTAGAAAG 
34501 CAAGCCAATC CCCAGCCCTG ACAGCAATTG TTTCAGGCAA AGCTTTGTTT 
34551 TCAGAACACA GTGCTGGATG CTGCCTGAGT CCAGGTGGGG ACCCAGTGAT 
34601 TACTAAGGAG GGCGACCAGA AGTGCCAGTT TCCTTGGGAC AGTCCCATTG 
34651 TATGCATCTT CTCCCAAAGT AATGATTGAT AGCAGCCATT TTTACTCTCA 
34701 AAATTGCGCT GTTTGGAAAA CAAATTTTAT GGTCATTCTA CGATGAAAGA 
34751 AATCTAGATT TCAGCTTTGT CAGGAAGGCA CTCACTTACT CCGGTTTTTT 
34801 TGGAGTCTAC CACCTGTGGA TTAAGCTCAA TTGGAAGAAA ATCTTCAAGT 
34851 AGGAGGTTAA TTTGGGTAAT ATGGTCCTAT CATATCCTAG GGTAGGGGGG 
34901 AATCTAGCTA CTTATACTGT GTCCTCTCCC AGAGAGTGAA TTTGATACAC 
34951 TGTTTCCTTC TCATTAGGTT TAGAAATTTG TGGTGGTGGT AAATCCAATC 
35001 ACTGGGTCTT ACTTGAGGTT CCTTTCTTTC CCTTCAAATA TTATTTTCAC 
35051 TTGTAAATAT TCTTTCCCCG TTGCCCTGAG TACAAGCATT TAGTATTAAC 
35101 AGTATAAATT TAGGCATTAC CTTTGTAATG TCTGTAAAGC AGAACATGAC 
35151 TTATTTGCAT TTTAAGAGCA GCAAATATTG TTGGCTCATA ATGTCTATCT 
35201 GCATGACCTG TACTGGGAAA GATTATTCTC CATTGCTGAC TTTTTTGTTT 
35251 TGTTTTGCAG GGTCGATAAG ATTTATTTTG TTTTAATTTT AATTTCAATA 
35301 GTTTTTGGGG GAACAGGTGG TGTTTTGTTA CACGGATAAG TTCTTTAGTG 
35351 GTGATTTCTT GGATTTTGGT GCAACTGTCA CCTGAGCATT GTACACTGAC 
35401 TTTAATGTGT AGTCTTTTAT CCCTCATTGG CCTCCGACAC TTCCCCCCAA 
35451 GTCCCCAGAG TCCATTATAT CATTCTTATG CTTTGCATCC TCATAGCTTA 
35501 GCTCCCACTT ATAAGTGAGA ACATACAATG TTTGGTTTTC CATTCCTGAG 
35551 TGACTTCACT TAGAATAATA GTCTACAACT CCATCCAGGT TGCTATGAAT 
35601 GCCATTATTT CATTCCTTTT TATGGCTGAG TAGTATTCCT TGGTGTATGT 
35651 GTGTGCATGT ATATATATAT ATATATATAC ATATGTATAT AAAATAAAAT 
35701 ATAAAAATAT ATATATATCT CATTTTTTTC TTTTTTTTTC CTTCAACTTT 
35751 TATTTAAGAT TTGGGGTATA TGTGCAGGAT GTGCAGGTTT GTTACATAAG 
35801 TAAATGTGTG CGTTTGCTGC ACAGATCTAT CCATCACCTA GGAATTAAGC 
35851 CCAGCATTCA TTATCTATTC TTCCTGGTGC TCTCCCTCCC TCCGTACCCC 
35901 TCAATGGGCC CCAGTGTGTG TTGTTCCCCT CCCTGTGTCC ATGTGTTCTC 
35951 ATTGTTCAGC TTGTGTTTAA AAAAGATAAC ATGTGGTGTT TGGTTGTCTG 
36001 TTCCTGTACT AATCTGCTGA GGATAACGAC TTCCAGCTCC ATCCATGTCC 
36051 CTGCAAAGGA AATAATCTCA TTTTTTTGTT TTTTTGTTTT TTGTTGTTGT 
36101 TTGTTGTTGT TGTTGTTGTT TTTTTTCATG GCAGCACAGT ATCCCTTGGT 
36151 GTATATGTAC CATATTTTCT TTATCCAGTT TATCACTGGT GGGCATTTTG 
36201 CTTGATTCCA TTTCTTTGCT GTTGTGAATA GTGCTGCAAC AAACATATGC 
36251 ATGCGTGTAT CTTTATAATA GAATGTTTGA TATTCCTTTG GGTATATACT 
36301 CAGTAATGGG ATTGCTGGGT CAAATGGTAT TTCTGCCTCT AGGTCTTTGA 
363 51 CGAATTGCAA CACTATCTTC CGTAATGGTT GAACTAATTT ACATTCCCAT 
36401 CAACAGGGTA GTAAAAGTGT TCGTATTTCT CCACAACCTC GCCAGCATCA 
36451 GTTGTTTTTT GACTTTTTAA TAATCACCAT TCTGACTGGC ATGAGATGGT 
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36501 ATGGTATGTT ATTGTGGTTT TGATTTGCAT TTCTCTAATG ATCAGTGATG 
36551 TTGAGCTTTC TTTCATATGT TTGTTGGCTG CATGTTATGT CTTCTTTTGA 
36601 GAAGTGTCTG TTCATGTCCT TTGCCCACTT TGTAATGGGG TCGTTTCTTT 
36651 CTTTCTTGTA AGTTTGTTTA AGTTTTTTGT TTCTTGTAAA TTTGCTTTTG 
36701 ACTCTGGATA TTAGACCTTA ATCAAATGGA TAGATTGCAA AATTTTTTTC 
36751 CCATTCTGTA GGTTGTCTGT TCACTGTGAT GACAGTTTCT TTTGCTGTGC 
36801 AGAAGCTCTT TAGTTTTATT AGATCCCGTT TGTCGATTTT TGCTTTTGTT 
36851 GCTATTGCTT TTGGCATTTT CGTCATGAAA TGTTTGCCTG TGCCTATGTC 
36901 CTGAATGATA TTGCCTAGAT TTTCTTCTGG GGTTTTTATA GTTTTGGGTT 
36951 TTACATGTAA GTCTTTAATC CATCTTGAAT TAATTTTTGT GTAAGGTGTA 
37001 AGGAAGTGGT CCAGTTTTCA TTGTTCTGCA TACGGCTAGC CAGTTCACCA 
37 051 TTTATTAAAT AAGGAATCAT TTCCCCATTG CTTGTTTTTG TCATTTTTGT 
37101 GGAAGATCAG ATGGTGGTAG GTGTGCAGTC TTATTTCTGT GTTCTCTCTT 
37151 CTGTTCTGTT GGTCTATATA CCACATTTTC TTTATCCACT CATTGGTTGA 
37201 TAGGCATTTA GCCTGGTTTC ATATTTTTGG AATTACGAAT TGTGCTGCTA 
37251 TAAATAGACG TGTGCAAGTG TCTTTTTCTT TTCTTTTCTT CTTTACTTTT 
37301 CCTTTCTTTT CTCCTCCTCT CCTCTCCTCT CCTCTTTCTT TTCTTTGACA 
37351 GAGTTTCACT CTTCTTGCCC AGGCTGGAGT GCAATGGCGC ATCTTGACTC 
37401 ACTGCAGCTG CTGCCTCCCA GGTTCAAGTG ATTCTCCTGA CTCAGCCTCG 
37451 TGAGTAGCTG GGATTACAGG CATGTGCCAC CACGCCTGGC TATTTTTTTT 
37501 TGTATTTTTA GTAGTGACGG GGTTTCTCCA TGTTGGTCAG TTTGGTCTCC 
37551 AACTCCCTAC CTCAGGTGAT CCACCTGCCT CGGCCTCCCA AAGTGCTGGA 
37601 ATTACAGGTG TGAGCCACCG TGCCCAGCCA CAAGTGTCTT TCTCATACAA 
37651 TGACTTCCTC TGTGTAGATA CCCAGTAGTA GGATTGCTGG ATTAAATGGT 
37701 AGTTCTGCCT TTAGATCTTT GAGGAAGCTC CATACTGTTT TCCACAGTGG 
37751 TTGTACTAGT TTACATTCTC ACCAGCAGTG TAAAAGTGTT CCCTTTTCAC 
37801 CATGTCCATA CAAACATCTA TTATTTCTTG ATTTTTAAAT TATGGCCATC 
37851 TTTGCAGGAG TAAGGAGGTA ATTAGTGATG TTGAGCATTT GTCCATATGT 
37901 TTGTTGGCCA TTTGTATATC TTCTTTTGAG AACTGTCTCT TCATGTCCTT 
37951 AGCCCACTTT TTTTATGGAA TTGTTTGTTG TTTCCTTGCT GATTTGTTTG 
38001 AGTTCCTTGT ACATTCTGGA TATTAGTCCT TTGTTGGGCG CATAGTTTGT 
38051 GAGTACTTTT TTCCCAGTCT GTGGGTTGTC TGTTTACTCT GCTGATTTTT 
38101 TCTTTTGCTG TGCAGAAGCT TTTTATTTTA ATTAAGTCCC ATCTATTTAT 
38151 CTTTGTTTTT GTTGCATTTG CCTTGGGGTT TTTGGTCATG AACTCTTTGC 
38201 CTAAGTCAAT GTCTAGAAGA GTTTCTCCAA TGTTATCTTC TAGAATTTAT 
38251 ATGGTTTCAC ATGTCCTTCA ATAGCTCAGT TGGTAGAGTG GAGAACTATA 
38301 GGTGTGTGGA GCTAATTTGT ATGGTTTCAG GTCTTAAATT TAAATCTCAG 
38351 ATCCACGTTG AGTTAAATTT TGTACAAGAT GAGAGTTGAG GATCCAGTTT 
38401 CATTCTTCTA TATGTGACTT GCCAATTATT TCAGCACCAT TTGTTGAATA 
38451 GGGTGTCCTT TGCCCACTTT ATGTTTTTGT TTGCTTTGTT GAAGATCAGT 
38501 TGGCTGTAAG TATTTGGCTT AATTTCTGGG TTTTTTACTC TGTTCCATTA 
38551 TTTTACATGC CCACATTTAT AAAAGTACCA CGCTGTTTTG GTACCTTTGA 
38601 CCTTATAGTA TAGTTTGAAG TCAGGTAATG AGATGCCTCC AGGTTTGTTC 
38651 TTTTTGCTTA CTCTTGCTTT GGCTATGCGG GCTCTTTTTT GGTTTCATAT 
38701 GACTATTAGT ATTGTTTTTT TCTAGTTCTG TGAAAAATGA TGATGATGTT 
38751 TTGATGGGAA TTGCACTGAA TTTATAGATT GCTTTTGGCA GTGTGAATGA 
38801 AAATATCTTG GGCCCCCCAT ATCACTAAGC TAAAAGGAAA ACTCAAGTTG 
38851 AAAACTGTGC AGGGTAAAAC CTGCCTCTCC ATTCTATTCG AAGTTGTCTC 
38901 TCTGCTTACT GAGATAGATG CATGTTCTAA TTACCTCCTT TGGAAAGGCT 
38951 TATCAGAAAC TCAAAAGAAT GCAACCAGTT TTCTCTCACC TGCTTGTGAC 
39001 CTGGAAGTCC CCTAACTGCT TCGAGTTGTC CCCACCTTTC TGGGTGGAAC 
39051 CAACATCCTT CTTACATGTA TTGATTGATG TCTCATGTCT ATCTAAAATG 
39101 TATAAAACCA AGCTGGGCCC TGACCACCTT GGGCACATGT TATCAGGACT 
39151 TCCTGAGGCT GTGTGTCACA GGCATGTGTC CTCAACCTTG GCAAAATAAA 
39201 CTTTCTAAAT TAACTGAGAC CTGTCTCAGA TTTTCTGGGT TTACAGCAGT 
39251 ATGGTCATTT TCACAGTATT GATTCTACCC ATCCACGAAT ATGGGATGTG 
39301 TTTCCATTTG TTTGTGTTGT CTATGATTTC TTTCAGCAGG TTTGTAGTTT 
39351 TCCTTGTAAA GGTCTTTTAC CTCCTGGTTA GGTATATTCC TAAGTATTTT 
39401 ATTTTTTTGC AGCTATTGTA AAAGAGGTTG AGTTCTTGAT TTGGTTCTCA 
39451 GCTTGGTCAC TGCTGGTGTA TAGTAGTGCT ACTGATTTTG TATCCTGAAA 
39501 CTTTACCAGA TTCATTTATC AGTTCTAGTA GCTGTTTGGA TGAGTCTTTA 
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39551 GGGTTTTCTA GGTATACAAT CATATCACTT GCAAACAGCT ACAGTTTGAC 
39601 TTCCTCTTTT CTGATTTGAA TGCCCTTTAT TTCTTTCTCT TGTCTGTTTG 
39651 GTCTTGCTAG GACTTTCAAT ACCACATTGA ATAGAAGTGG TGAAAGTAGA 
397 01 ATCCTTGTCT TGTTCCAGTT CTCAGGGGGA ATGCTTTCAA CCTTTCCTCA 
39751 TTCAGCATAA TGGTGACTGT GGGTTTGTCA TAGATGGCTT TTATTACCTT 
39801 GAGGTACGTC CCTTCTATGC CAAATTTGCT GAGGGTTTTC ATCATAAAGG 
39851 GATACAGAAT TTTGCCAAAT GTTTTTTCTG CATCTACTGA GATGATCACA 
39901 CAAATTTTTT AAATTATGTT TACATGATGT ATTACATTTA TTGACTTGCA 
39951 TATGTTAAAC CATTTCCGTA TCCCTCACAT GAAACCTACT TGATCATGGT 
40001 GTATTATCTT TATGATATGC TGCTGAATTT GGTTAGCTAG TATTTTGTTG' 
40051 AGGATTTTTG CATCTATGTT CATCAGGGAT ATTGGTCTGT AGTTTTCCTT 
40101 TTTTTGTTAT GTCCTTTCCT GGTTTTGGTA TTAGGGTAAT ACTGGTTTCA 
40151 TAGAGTGATT TGGGAAGAAC TCCCTCTTTC TCTAGCTTCT GGAATAGCTT 
40201 CAGTAGGATT ACTACCAATT CTTCTTTGAA TGTCTGATAG AATTCAGCTG 
40251 TGAATCCATC TGGTCCTTTT TTTTCTTGGC AATTTTTAAA TTACTGTTTC 
40301 ACTCTCGTTA CTTATTATTA TTATATTTTG CATTTCTCTA AGTCTGTCTT 
40351 TCATTTCCAG AAGTTGTAAT TGTCTTTTCT TTTCTTTTTT TTTTTTTGAG 
40401 ACGGAGTTTC GCTCTGTCGC CCAGGCTGGA GTGCAGTGGC GCGATCTCGA 
40451 CTCACTGCAA CCTCCGCCTC CCGGGTTCAC GCCATTCTCC TGCCTCAGCC 
40501 TCCCGTGTAG CTGGGACTAC AGGCGCGTGC CACCACGCCC GGCTAATTTT 
40551 TTTTTTTTTG TATTTTTAGT ACAGATGGGG TTTCACCGTG TTAGCCAGGA 
40601 TGGTCTTGAT CTCCTGACCT CGTGATCCAC CCGTCTCGGC CTCCCAAAGT 
40651 GCTGGGATTA CAGGCGTGAG CCACCACGCC CGGCCGTCTT TTCTTTATAA 
40701 TATCTGTTTC TCTGGAGACT TTTTCATACA TACCCTGTAT TGTTGCTTTT 
40751 TTAAAAATTT CTTTATGTTT GTTTTCACCT CTCTCAGGTA CCTCCTTGAG 
40801 TAGCTTAATA ATCAACCTTC TGAATTCTTT ATCTGACAAA TCAGAGATTT 
40851 CTTCTTGGTT TGAATCCATT GCTGGAGAGC TAGTTGATCT TTTGCAGGTG 
40901 TTATAGAACC TTGTTTTGTC ATATTACCAG AATTACTTTT CTGGTTCCTT 
40951 ATCATTTGGT TAGACTGTTT CAGTGGTAAG ATCTGGAACT CAAAGGCTGC 
41001 TGTCCAGATT CTTTTGTCCC ATGAGGTGAT GCCTTGATGT GGTGCTCTCT 
41051 CCTTTCCCCT AGAGATGGGG TTTCCTAAGA GCTGAACTGC AGTGATCGTT 
41101 ATTGCCCTTC TGGGTGTAAC CACCCAGTGG GGCTATCAGG CCCTGGGCTG 
41151 GTGCTGGAGG ATGTCTGCAA AGTGTCCTGT TATGTGATCT GTCTTCAGGT 
412 01 CCCCGAGCTA TGGATATTAG CACCTGCTCC AGTGGAGGTG GCAGAGGAGT 
41251 GAAGTGGACT CTGTGGGAGT CCTTGGTTGT AGTTTTGTTT AGTGCACTGG 
41301 ' TTTTCTTGAA TGCTGATTAT TCTAGCAGTG AAGTTGTCAC ATGGACAAGA 
41351 TTCAGGACCT CTGGTTAGCC AGGATGTTGC GGATGGTGGA ATTAGCTGTT 
41401 TGTTTTCTCC TTATTTGGAG CAGGGTTTTT CTGTTATTAG TTGCTGTAAT 
41451 GGCTTGACTT GGTTGGCCTC CAGCCAGGAG GTGGCACTTT CAAGAGAATA 
41501 CCAGCTGTAG TCGTAGAAGG GGGATATAAG CTTACCCTAT GTTGGCCAGG 
41551 ATAAGTACCC AGGTTTCTCA GGCGATGATA GGTGGAGCCA TAGAGCTCCC 
41601 AAGAGTTTAT GTATTTTGTC TTTGGCTACC GGGGCAGGTA GACAAAAACC 
41651 ATGAGGTTGG GGCAGGGTTA AGCGTGTCTG AGCTCAGACT CTCCTTGGGT 
41701 GGGGCTTGCT GCAGCCACTA TGGAGGATGA TGGGGTGGTT CTCAGGCCAA 
41751 GGGAGTGTTG TTCCCAGGGG GATTATGGCT ACCTCTGCTG CATCATATAG 
41801 GTTGTCAGGG AAGTGAGGGA GAGCTGGCAG TGACAGGTCT CACTTGGCTC 
41851 CCACACAGTC AGCAAGGCTG GTCTCACTCC TGCCATGCCC CACCAACAGC 
41901 ACCAAGTTAA TAACCAGGCA GCTGGTGAAC AGGGCTGAGA TCTTGCCCCA 
41951 GGCTGTAAGC CTTCCCACTC AGAAAGCAAG CCGGGTTCAC AGGCCTTGCC 
42001 CTTCCCCACC TGCCCACATC ATTGGCAGGG GCTTCTGTGT TCATATCTGA 
42051 ACTTCTCATT CATCCCTCCA GATTCTGCTC AGGAAAATTA GTGCTCAGTC 
42101 TAAATTATTA CAAAGTTCAT CTAGGAGCTT CCTTCACTCT GTGGCCCATC 
42151 CCCAATTCTG ACTACCTTCC CTTCCCCAAG AACCCCTGTG AGAGAAAGCC 
42201 AGGAATGGCT TCCCTGGGAT GGAGCTGGGG ACTGGGAGTG CCTACAGGGT 
42251 TCTTCCTGCT GATTCTTCTA CTTTTATATT TTGCTTGACT CCCTAAATCC 
42301 ACTTCAGCTC TAGATAAGCT TAATCCTTCT CCTGTGATCT GGAGTTTCAG 
42351 GTCCCCAATG GAGATATGTG CTCAGAGGCA GACTTTTCCC CCATCTCACA 
42401 CTTCAGGAAC TCAGTTTTTC AGTTGTCTTG TGGAGTTTGC AGCAGCAAGC 
42451 TGCTTCTTTC AAAGGGTCTG TGAATTCTTT TGGCTTTTCT AATATGTTCT 
42501 CATGGTGGTT CTTGGAGCAA AAGTTCACAA TGTAAGTTTC CGAACACTGT 
42551 TCTGTCCATC CAAGTGGGAG CTGCATGTTA GTCCTGTCTC CTATCTGCCA 
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42601 TTTTACTTGA TCCATTGCTG ACfTTTTGTT GTACATGTGA CTGTGTAGGA 
42651 AAATTTGGGC CAGTGTAGCT CAGGAAAAGC TGTAGCAGCT ACACCCTGAG 
42701 CTCATGGAAC TTGCTAAAGG TCTAGCACTC AGCCATATTT TGCTAGACCC 
42751 ACAAGGCCAG CGTCTGTGGA GATTGTCTTT TGAGTCACTG CTTGGATAAG 
42801 CACCTCTCTA CCCTCACCTG CCTGGTTTAA AGTCTTGCTT TTATAGCTAT 
42851 GCCTTTCCCA AAACAGTTCT CATCTGTGGA CCAACCCACC TTAAGCTTCA 
42901 GACACCCAGT AAGGTGGAAA GGCCAAGGTC CCTGGCTCCC AGCTCCAGCA 
42951 GAGGTGACAC AGTAAGAAGT CAAATGCTTA ATGGTGTGCT GGTAAATACT 
43001 TAAAAACCAG CTCTCTATGG TAGGGACAGG AACGGGAGTC CCCAAACCTG 
43051 ATTTATAGTG TTTGCCAATT TCTGTGGTTG ATTTTAGTTC ATGCTTTTTG 
43101 TGTCCTGTCT GATAAATTTT TGCCTAATCC AAGGTAACAA ATACTTCTAT 
43151 ATTTTATTCT AATTTTTTTT * TTTTAGTTTT CAATGTTCAG TTATTCCAGC 
43201 ACCATTTGTT AAAAAGAATA ' TCTTTTCTCC ATTGACCTGC TGTTTACTTT 
43251 TATCAAAAAC CAATCACCCA TATATGCATG GGCCGACTTC TGTGAATGCT 
43301 ATTTTGTTAC ATTGATCTCT ATGTCTACCT TCATGCCAAC ACCACACTCA 
43351 TGATTACTGT AGCTATATAG TAAACATTTG AAACCAACTA GTAAGAATTC 
43401 TCCAAATTTG CTTTTTTATG TCCAAATTGT TTAGACACTT GTAGGTCTTT 
43451 TGTTTTTCTA CATTATTTTA TAATCTACGT GTGTTCATTC CTTTTTCTGC 
43501 TCATTTCTCA ACATCCCAGG AATTTGGAGT TGAAACTCAT CTTCTCCTAA 
43551 AATGCCAGAC AAGGCCTGGT CTTGTGATTT GTTGCCTGGA GGAGAGTAAT 
43601 TGTGAACTCC CAACTAACGT TGGGAGCAGT ATCCCAAGAA AGAGAGAGAG 
43651 GAAGGGCTGG TTTTAAGTCA TGTGGACTTC TAGATCAATT TCTAGAGTTA 
43701 GGTAGATTTA CACAATGCAC TGGATCAGGC AGGCCTTCCT CCACTTCCAG 
43751 AGGACAGAGG AATTCATCTT GTTCCAAAGA GACAGAGAGA GGAAGGGCCT 
43801 CAGTTCTGCA TGCTCTTTTA CCATCTTCCC TCAAACTCAC TAACCACTTG 
43851 .TCTCCTGAGG GGGTGAGGAA TGGGTGAGAA CAAGAAAGTA TAATAATGTC 
43901 ACTGCCATGA CATGGAAGAG CATGTCAGGA GTGGGGAAGC TGCATTCTCT 
43951 CCACCAGCAC TGACTCTACC CATTAGCATT GAGTGGTCAG CCTTATTTCC 
44001 CATTTCCATG TGCACAGGGA AACCTAGCTA GGTACTGTCA CTCACCTAGC 
44051 TAGGTTCCCC CGAGCACATG GAAATGGGAA ATAAGGCTGC TCTGTAGAAA 
44101 ATTTTATGTG . ATCTAGATTC AGCTGACTTC ATAAGTCAAT GCCACAGAAA 
44151 AGGAGTTGCA TAATTTTTTA AGGAGTCTTT TTTTTTTTAA ATAAAGCCTA 
44201 GCTATAAAAT GGCAATAAAA TGTCAGGATT AATTTTTTGA ACATATACAC 
44251 CAACTTCTAT TTCTGAAAGA GTAGTCATGG TGGGTTCATC CTCTTATCCC 
44301 AAATAAGTTT TGTTTCTTGA AGCACATCTT TTCCCTGAGA ATATCTCCCC 
44351 CAGTTATTTC TGGCCTCTGG TGGGTGCCTG ACATGTCTCT GCTCCAGCTC 
44401 CATTGGTGCT TTACCATTTC TCTGTGAAGA TGCCAAACCT GGGTGAAGGG 
44451 CTCTGTCCTT GGGTGCCCCA CAGCTGATCT TCCTGGTACA GGATTCCCCA 
44501' TAGCATAGGG ACAGTGTCTA CCTTGAACTT TTTCTTCTTG GGCCCCGAGG 
44551 CAACCTTTCT CTTATGGGGT TCCCAAAGAT TTTGTTTGTT TAAGTCAGCT 
44601 CTTTCAATTT CAGAGTTACC TCTGTTTTAC ACAAGCTCTC CACACACTTT 
44651 TCTCTTCTCT CAATACAACT CAGCTCTCCT CAACAAAATC CCTAATTGTC 
44701 ATGGACAGGT TCTCTCCAAA TAGAGCCTGC AGGTTCACCG GCACATTAGT 
44751 TTCTAAAAAT ACTTTCTCTT TATGCTTGGA TGAGACACTC TGCTATAATT 
44801 TCCCATCTTT TCCTACTCTC AAAATAGAAA ACAACTAATG AACTATAAAT 
44851 GTGTTGATTT GTTGACAATT TATTTTTCTA TGGCAGTGTA CTCTCACTAA 
44901 GCATGGTACC TCAACAAGAC ATTTGCATTT TATGCCTCAT CGCAAATTGA 
44951 AATAAAAGAT CGTTTCTTGT TCTTATGTCT TTAAGTGGCC TATGGGTACA 
45001 GCTGTCTGCC AAAAAACCCT GGAAAAGGCT CATTCTCTGC TGATGTAGGT 
45051 TTTGGCTTCA ACTTCTTTCT GCAACATGAG AAATTTTGGG GGCCTCAGAC 
45101 ATAGGAAAGA AAAAGAATTA ATATTTGGGA GATATTTACT GGAGATTGAG 
45151 AAATTACAGC CTATTATTTA GTGCTTACCA AGTGCCAGGC ACTGAGTTAA 
45201 GTGCTTTATA ATCAATTTCT CCTTTCCTGC TTCATTTTTT CTTCATAGAA 
45251 CCTATCGCTC TCTCAAGTTA TACTACATAT ATTTATGTAT TTATTCATTC 
45301 TTTTTTTTGT TGTCTGTCTT CCCCTACAAG AGTATAAGCT CCATGAGGAC 
45351 AGGGATTGTT GTCTTTTTAA TTATCTGCTA TATTCCTAGT TCCTAGCTCA 
45401 AGGGAGTCGC TCGAGATATT TGTTGCATTA GTAAATCTCT TATCCCCTAC 
45451 AGCAAGCTGT ATGATTATTT TCATGTATGA GAAAATAAGG GCTTAAACAG 
45501 GTTTTGTAAC TTTTCCAAGG CCGTCAATTT GGTAAGTGGT AGAACCAAGA 
45551 ATCAAAGCCA GGATTCAACT CATGAGTCCA TAGTTGTACC ATTTATCTAA 
45601 ATTACCTCCC ATCTCTTTCA ATCCTGACAG CAATACGCTC TAAAGTAGAT 
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48701 TGTTTGTTTA TTCATTTATT TATTGATTTA AAAATGAAGT TTTTTTTCTG 
48751 CTGTTTGATG ACAGTGCAAC CTATTCTTAT CTCAAATAAT TAAAATGTAC 
48801 TCTCTTTTTT GATGTTTGGA TCTGAGCACT AATTTTCATC CAATTTAGAG 
48851 TGGCTAATTG CAAAATAAGA GAGGATGATG TGATAAGTTT TTGTGGTATT 
48901 TAAGATGAAT AACACTTGTT TATTTTTAGT GACATGATGA ACAAAGAATT 
48951 ATTCAAGATG TGCCCTAGAT TTCCGCTGGT AGCCATTACC CTATGAACAG 
49001 GGTACCCCGA TGTGTTTTAT TTAAAAGCTT CACATTGGAC CTCATGATCA 
49051 CGCACTAGAT TCATGGTTTA TTAACCCACA TTAACTGACC GACTGATAAG 
49101 TTAAACATTG TGGAAAGGAT TCACTCCCAG CATCTCCCCA AGGTGCAGCC 
49151 ATATATGCTG GATACAGATA CCCCTTGTTT TTCTTGTTTA TATTTATAAC 
49201 CCAGGTACTT AGGCACTGAG ATTTCTGTCT GTTTTTCCTT TCACTTTTTC 
49251 AAGGCATAAG CTATCTTCTT AGAACCTCCT TCCTATTCTC TGGTCAAGGT 
49301 TTGAATAATA CCCACTTGTT TCTAGAAATG TCCCTCATAC ACACTTCCAT 
49351 GCTTGGAGCA GCTGGTAGCT TTTTACAACA GTTGATTCTT CCCATTCAAT 
49401 ACTGAGCATA TGGGTTGGAG TCTCCTTCTA AAAACTGAGC GTCTGGGTTT 
49451 TCAGAAGTTC CAGGAAATCT GTAATGTAAG GCAGAAGCAC AAACTCTTTG 
49501 TGAAGAATGA AGGAAATAAA TGGATCTTGT ATAAGAAGTT TCAGGAAATA 
49551 TCTGTATGGA TGAAGCATTT GAGGAAAAAA GATGATACTC TTGAAATAAG 
49601 ATTTTATTTT AAGCCCATAG AAAATCTAAA GATAGTGTTA AAGTTCTGGA 
49651 AGGGTTTTGA TGAGAATTGT CTGATGGGTC TCATCATATT GCATAATGAA 
49701 TTAAACAAGT TTTGCACCAA CTTCATTTAT TGACATACAT AGTCCTTAAG 
49751 ACTCCTTCTC AAACTCAGTG AGTAAGAACT TCTTTATCTT TAGATGGCTT 
49801 TGTTGACATG ATTATCTAAC CCTGTGGTTT CACTTGGTCT TCTTTACCTT 
49851 ACATACATCC TTGCATGTCA ACTGTCTTCC ACTTTCCTGG GTTCATTTCA 
49901 TCACCCTGAG TCTCACAGGA CTTGTTAAAA CTTCTTGAAT ACTTCCAAAG 
49951 AATTCATTGA TATTAAAATA TATTTAAATT ATTAATAACC AAACCATGTG 
50001 GTTCTGGAAT AGTAATAAAC TGTGTGTTAG TCTGCTCTCA CATTGCTATA 
50051 AAGAAATATC TGAGACTACA TAATTTATAA AGAAAAGAGG TTTAATTGAC 
50101 TCACATTTCT GCAGGCTGTA GAAGAAGCAT GATGCTGGCA TCGGCTTGGC 
50151 TTCTGGGGAG GCCTCAGGAA ACTTACAATC ATGGTGGAAG GTGAAGGAGC 
50201 AGGCACATCA CATGGCCAGA GCCAGAGCAA GAGAGTCAGG AGGGAGGTGC 
50251 CACATGCTTT TAAATGACCA GATCTCATGA GAACTGACTC ACTAACATGA 
50301 GAATAGTACC AGGGGATGGC TCTAAACAAT TCCTGAGAAA ACTGCCCCCC 
50351 TAATCCAGTT ACCTCCCACC AGGCCTCACC TCCAGCATTG AGGATTACAA 
50401 TTCGACATGA GATTTGGGCA AGGACACACA TCCAAACTAT ATTAAATTGA 
50451 TAGATAAATG AAACAAAGCA CAGATTCTAG AAAATGTTCC AAGAATATTT 
50501 GGGAATATAG CATTTAAGTC AGTGGAGAAA AGATGGATTA TTCAACAACT 
50551 AACGTTGGAG TAAGTGGCTA GACACTGAAA AGAAGCCAAG GCCTTAAAAT 
50601 AATTATAGAT GAATCAGTAG TTCAAATTTT AGAGTTATAC CATATAAATA 
50651 CTCTATGAAA ATATGGATGT TTTGGTGATT TGAGAGCTGA AGGTTTTTTA 
50701 AGGTCTAACA TCCCAGCTAG AAACCACAGA AGAAAAAAAT AATTGATTGA 
50751 CCACTTAAAA TGATAAACTT TTATATAGAA AATATGTCAT TAGGCAAAGT 
50801 TTGAAGAATG AGAAAGTGAG GAACATATTT GTACTGTAGG TCAGGATTAA 
50851 TATCTCTAAA ATATATATTT TAAAACTCCT ACAAAGCATT GAATAAAAGA 
50901 TTAATATTGC CATATAGAAA AAATGTGCAA AGGACATAAA CAGATAATTC 
50951 ACAATAGGAT CAATAAAAAT GGCCAATAAA CATGAAAATA TATATAACAT 
51001 TCCAGATATT CTGCAAAAAC AACAGCATTT TTTTCCCTCA TTTATTTGAT 
51051 TTGCACACAA CAATCTTGCT AACACAACAC GTAAATAAAA TATAGGGAAA 
51101 TGGGCTGGGT GTGGTGGCTC ACATCTATAA TCCCAGCACT TTGGGAGGCC 
51151 GAGGTGGGCG GATCACGAGA TCAGGAGATT GAGACCATCC TGGCTAACAC 
51201 GGTGAAACCC CATCTCTACT AAAAATACAT TAGCCGGGCG TGGTGGTGGG 
51251 TGCCTGTAGT CCCAGCTACT CAGGAGGCTG AGGCAGGAGA CTCGCTTGAA 
51301 CCTGGGAGGT GGAGATTGCA GTGAGCCGAG ATCACGTCAC TGCACTCCAG 
51351 CCTGGGTGAC AGAGTGAGAC TTTGTCTCAA AAAAAAAAAA- AAAAC ACAC A 
51401 CACACAAAAC AAAAAAACAG GAAAATGGAC ACACTTATGC ACTGTGGTAC 
51451 AATGTTTCTC AAGGGCTGTT GGTAAAAGGC ACCAAAAATA GAAATTCAAC 
51501 TTCTAGGAAA TTATTTAAAG GAAGGAAATG AGCTTGTGTG TCAAGAGATA 
51551 AATGTTAATG ATCATTGTAG TTTTGCTTGT TATATTGGCA AAGAAAGTAA 
51601 AAAGATTCAA GTTCAGCATC CTAATACTGT TCCTAATATG GAAACGAGCA 
51651 TGACTTATAT TTCTTATAGA ATAGTATGCA GCTGTTAATT AGGGTGTCAG 
51701 TCTACCTTAA TTGTAATAGG AAGACAGCCA CAATGTTATT GAATGAAAAA 
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51751 GTTGACTACC CGATGCCATG TATAACATGA TACCATTTAT TTAAAAGTGT 
51801 GCACATTTAC ATTTATGAAC TCAATATGAT CCATCACATG GGCTTTTGCT 
51851 ATTCAAAATA AAATATCATA TCACCTTTTT AAAATTTCAA ATTAAAAGGC 
51901 CCAAGTCACC AAGTTGCCTC TTTTCTCTTT CTAGGGCTTC CTCTCATGAT 
51951 AGTCCATGAT GGTCTCAGCA AGTTATCAAG GTGCTGTCCA TAGGGTCAGT 
52001 TCAGACTACT TTTGGCTCAT AGCTGGAGCC. AGATATTAGC CACATTCCCT 
52051 TGATGCTGGA GATCACCACA AATACCACTT AAACCTGGTT CCTTTGAATA 
52101 GCATCATCAG TTACCCAGCC AAAGACCTGG AGGTTGCAAC TTCACTCTGC 
52151 CTGTTTGACA TGTCCAAGTC CACAGCTGAA GAACAGCATA ATTCTCATCA 
52201 AGTCCGAATG GCATGACCAA GTCTGAGCCT CTGTTTCTGG TAGTCACTCC 
52251 TGGCCCTAGA GCAGATTTTT TCAAGAAAGC ACAAGTGGTC CTTTGATGAA 
52301 CGAGAGAGAA ATATTACTGT GACCCAACTG TGTACACCTG TCAAACTGGG 
52351 TCTGCTCTCT CAATGCTTCA GCTTTACAGA TTGTACCACT TCCGTGCCCC 
52401 CGACCCTAGC CCCAGCTGCT TTCCCAGGGG GCAAGACAGG TTTGATGGAT 
52451 TCCAGCGTGT ACTAATCTTA GGGCATAGTG CATGGAAATG CCCACCCATC 
52501 TTGTCTTTCT CTATCTCTAT GGGGATTTCT TTACAGAGAG GTCTTTTTGA 
52551 TCAAACTGGA TCTACCCAAT ACATTTCTGT ATCTTTTAAT ATTAATGAAA 
52601 ATGGCTGGGC TTTGGAACTT CTCCTCCAGA AGGTCTGTGT ACTCCATTTG 
52651 TAGTGTTTTT CTTCATATAT CTCAGCAATA TTCAAACTCA GAGTACCAGC 
52701 AGTATGTTTG GAAAACAGCA AATTAAAATT TAATTTTGCT TGACACCACC 
52751 TCTCCCAATA GTATTTTTGG AAAACAGAAA ATTAAAATTT AGTTTTGCTT 
52801 GACACCTACC TCTTCCACTG GGAAAAAAAT GGGTGTTACA GGGTGAGAGA 
52851 CAATAGAAAT TATTTAGGAT GCATTTTCAG CAACATTTCA ACTTGGCTAT 
52901 TTCCAGGCAA TAAGATTTAA GGTGATTTGT CTATAATTTT TTCCTTCATT 
52951 TTAGTTTGGA AAAGAGCATG TAACATTTTA AAAAGAAGGT AAAAAGCTAT 
53001 TTATTTTTGG AAAAAAAAAT TTCTCAAATC TCTTTGTGCA AACTCTTGCT 
53051 ATTAGAGCTT GGACAAATTA TTTAATGTTT GTAGGCCCCC TTAACCTATA 
53101 TGTGAAATGG AAGAGTATGA TTACTTACCT TTAGTCGAAA AATGTTCTCT 
53151 TATTTTCCTA AAAGAAATAA TTCACAATAG TACAGAGGAG GTGACACACT 
53201 GTGTTGTGTT TTCATTTTAA AAATGAGAAG TTGGTGAGAT AGTCTTTAAG 
53251 GTCTATCTCT GCTTTAACAT TCAGTGAGCT TTGGAAAAAT TAATGAC TTA 
53301 GAAGACACAC GAGACAACTC TTGGAAGAGG TTGGCGCCGC TAACTCTATT 
53351 GACTTAAATG ATCACATTTA GTTGGAGCTC CTTCAGAATG AGATCTCTTA 
53401 TTTACCAATT GCAGAGATTT ATTTCCTTTA TTTGTCTAGT TAAAAGGCAC 
53451 TGGGAAAAAT GAGGCCAGTC CCTCACTCTT CAAGGGAGCT CTTATTTCCC 
53501 TGCACTGCCT ATTAGACTGG GGATAGCTGA AAGTTTTTCA TTGCTTTTAC 
53551 TTAGGACACG TTTGTAAGGC ATATTTTACG AAGGTTGCTT CAACAGACTT 
53601 TTCAACCCCC TCAGCAACCA TTCGCTCATT CTGCTCTTTT GCTTAACATT 
53651 GTCCTTTTTT GCCCTCATGA TCTGAGACCA CATTTGACTG AGATGAGTTG 
53701 ATTTGTCTGG CATTCCTCTT CTCTGTTTCA CTCATATTCT CTCTCAGGAC 
53751 TTTCTTTCCA TGATAATTTG TTGGGGTCAT TCTTTAGGTT TGTAATTTTC 
53801 AACAAATGTC ATTCAAACTG TTTTGTCTTA TTCATGGATT TGCCTCTAAC 
53851 ATGCATATAA TTACCTATTA AAGCATAGCA TTTGAACTAT TCTAAAGCAA 
53901 AAGATCTTGT AAATCACTGT AGTACTGTTA GGATTGGAAC TCAGAAATCA 
53951 AGTCTCAAGG GGAAACACGT TGCTTGGTTA GGATAGTTGT CCATCCTGCC 
54001 AGAATTCAGA TGTAGCTGCT ATGGAACTGA ATGTTTGTGC CTCTCCCCAC 
54051 TCAATTCATT GTTGAAATCT TAAAGCTCCA AAGTGATGCT GTTAAGATGA 
54101 GGGGCTTTAT GGGAGGTC3AA TAGGTCATGA GGGTGGAGCA TTCATGAATG 
54151 GAATTGGTGT CCCTGTAAAA GAGACCCCAG AAACCTCTCT TGTTCCTTCT 
54201 GAGATGCGGT GACACAGCAA AAAGACAGAC ATCATTTATG AACCAGGAAG 
54251 TAGGCCCTCA CCGGACACTG AATCTGGCGG CACCTTAATC TTGGGCTTCC 
54301 CAGCCTCCAG AACTGTGAGA AATACATTTT TGTTTATAAG CCACACAGTC 
54351 TTTTGTGTAT ATTCTGTGAT TGCAGCCTGA AAGGATTAAG TCAGGAGCTA 
54401 AATCTGGATC CCAATTCTGT GCTCTTTTCT ACCTCTATTC TCCTCTCCTC 
54451 CCTCTCTTCC TTCTTTTTTC CTCTAAATTA TTTAACTTAA AACACTTTAA 
54501 ATCACAAATG ACTAGAATAA AAATGTGAAA TCCCCCATCA GAGAATTCCA 
54551 CCCTCACCCT CTCCTTTTTA GTTGTAGCCA CTATTTTGAT TTGTCTTAAT 
54601 AACCATTTTT TATATGCGGG AGATTTATGT ATATATGATA TACATATATA 
54651 TGTCTCATGT GGAGTTTTTC TTTTTTGTGT GAAAATGAAT TGTGATAAAT 
54701 GCATAATTCT GTGACTTTTT AAAAAATGTA ATAAGCTCTG GATATGTTTT 
54751 TGGTTAAGTA GATCTAGTTC TACATTATCT TTAATCCAAC CGTGATAAGT 



WO 02/086113 



19/219 



IPCT/GB02/01887 



54801 GAGGACGATA ATATATTTAA CCATTTTTCT ATTAATGATC ATTCAGCCCC 
54851 TTTTAAATAT TCTTTATAAA AACAATGCTA CAATAAATAT CTTTTAAACA 
54901 TATGTGCAAG TATTTTCTCA CAATCGTGCA AGATCTAGTT TATGTACTTT 
54951 GTCAAATGTA CTTGGTCATA GTTTTCTGCT GGGCGTGAAA GAAGATAGCC 
55001 TGGCACTGAC TACATTCTTA TTGGCCCCAG AGGGGAGCTT GATCAATGCC 
55051 AAACAGGGGA TATGAGGGAG CTGGCTGATA AATTACTCTC CCTCCACCCG 
55101 ATAAGTTGTT ATGAGACACA GTGTTTCCAA TGCCCCTTCA GGAGATGGGC 
55151 TGTGTAGGTG AGCAACCAGA TATGGTTTCT AATGAGACCA CAACCAAATT 
55201 AATTATATAA CATTTTGTAT TTTATTTTTG CCTTTCTTTG CCTCATTTTC 
55251 TTTTTTCCTT ACTCTAGCTT CCTTGCGATT ATACTCCTCA TAAGACTTTC 
553 01 TATGTAAATA TTGTCTCAGG CTTTGTTTTC TAGAAAAGGC GGGCTTAGAA 
55351 AAAGATATAT ATTGAAAAGA CAAATTATTA GGTCAGAGGG TATGCGAGTT 
55401 TGGGCCACCC TACTTTTTAA TCAGGTACAT GCAATTACAA ATATGTAGTT 
55451 ATCTTTTCTT TTATTTTCTA TTAATTATCA ATATTTCAAT TATGAAAAGA 
55501 ACTATATTAT TATTATATAC ATTATATAAA GTAATATATT GTTACATTAT 
55551 GAGCATTCAA TCTTAGTGCA GAACCTCTCT GTCTGGATGT ATATTCAGGT 
55601 AAAATGTTAC ACGAATTCCT GTTAAGTGAT GAGAATTCCA GTGCTTTAAT 
55651 TTTGCAGCTA AATAAAGCAA AGCCCAAAGC CGGAATTAAT TCTTTATTCT 
55701 TCTCTATATT ACATCATAAA GCAGTTATTC CTCCCACAGA AGCCATCTCT 
55751 CAGAAAATAT GAAGTGGTTC AAAAAAACAT CATTAGGACA CCCTACATTA 
55801 CATGTTGCTG TTGGGAAGAT TTTATACAGC ATAAGCTTTC ATAAGTTCAT 
55851 AATGGGGAGA TAACTTGATT AACCTTTTGC CCAAGTTTTG CCATAATATT 
55901 GAAACAACAG AGAAATTAAA GAATATATCT ATATTTTTTT CTATTTGGTT 
55951 TTCCAGGAAA AAGTGGTTGT AGGTTAAGTG AGAAGTGGTT AGCATCTAGA 
56001 ACAAATAGGA TTTCTCACAA CTTGGGGAAC TGAATCTAAT CTCTGGCTTG 
56051 TCAACTTGCC AAGGTGTTCT TAATACCATC CCCAGAAATC CTAATTCAAA 
56101 CTCACATTTT GGCTTCTGGC GACTGCCCGG CAACAGCGAA CATGCAGCTC 
56151 ACTCCACCAC ATCCATTCTG TGGGAGTCCC TGTAGGACTG TATTTTCAAA 
56201 TGAGCATGTA ATTCACAATA CTCCTTCATC GTGAGAATGT GGGGGAACAA 
56251 CAGATAGCCC TTAAATCATG GATTGGGAGC TAGAGTGTAA CTTGCTAGGT 
56301 CAGGCACACC CCTTCATATA CTCATCCTGT ATTGTTGGCC ATTCATATGG 
56351 AGATAGGTTC ACAGAGGAAA TTTCTATTGC CCTGTTAATG AAGGCCCTCT 
56401 TTACATCTTT TCATCGATTT ACTGTGGCTG TTCTGACCTG GCTGCACTGA 
56451 CAGGACCTTG AGGCTTAATC ATGAGTAGAA AGAAGAGCCT GAATTTTCCT 
56501 GGAGAGTTTC CAAGGGAGAT GCTTGTAGCA ACAAATTCCT ATTTAGATCA 
56551 GGAAATTCGA GAAAGTAGGT GTTAAGTAGA TATATGAGAG TTGTGTTGGC 
56601 TTTCCATATT TGCTTAAAGT GGGAAAATGG ATTGTGTGGC GATTTGGATA 
56651 ACTCCTGAAA CCAATGTGCT CAGCCAAATG TAAATGTGGG GTTGCAACAC 
56701 TGATGTGAGA GGAAAGGAAA TTGTTGATGG AGTGGAAATG CTTCAGCTGT 
56751 GCTTAGATTG CCAGTGAGCC AGATACACCT GTGTTGGCTT GGCGTGAGGA 
56801 CTGCATGGAT TTGCGAAATG AAAACCTCAC TTACATTCTA ACCAGATTAA 
56851 AGACACTGAT GAAGTCAGGC CAGTTTTACT CTGCAATTAG GAAAACTGAG 
56901 GCTCTCTTCA GTAGTCATCT CTTGAAACAC ACTGCTTTGC AGTQCTTTGT 
56951 TGAAGAAGTG GTATTTATCT AACGCCTATC AGCCTGGAAA CCGTATCTTG 
57001 AATCCAGTGT TGCCACTCCT CAAATTGCAG CTTCCTGGTG GCAACTGAAC 
57051 AAGCAAGTTC ATACAGCAAA GGGGAGAAAA CTAAAAGGTC ATAGCATCCC 
57101 TTCCATGATT AGCATTCTGT CTTTGACCAC CTCTAGTTTC TGAAATAAAC 
57151 AGAAACTTTG TAGTAAAATC TTAAGAAAAT GTTTTAATTT TTTTCAAAGT 
57201 AACTCATATA AGTAGTTGCA GAAAATAGAA AAGTATGGAA AATCTTATGA 
57251 TGGAAAACAA CATTCTTCTT TGAGGTGTTA AGGTCTGTTA ACTATTACAG 
57301 AATAAATTAA TATATAATAT GTAACTTGGC TGTATGTATT TTCAGCACTG 
57351 TTGGTATGGT TTGGCTCTCT GTGTCCCCAC CCAAATCTCA CCTTGACTTA 
57401 TAATAATCCC CACGTGTCAA GGGCAGGACC AGGTGGAGAA AATTGAATCA 
57451 TGGGCGTGGC TTCCTCCCAT GCTGTGATAG TGAGTGAGTT CTCATAAGAG 
57501 CCGTTGGTTT TATACGGGGC TTCCGCCTTC ACTCAGCGCT CATTTTCTCT 
57551 CCTGCTGCCC TGTGAAGAGG TACCCATCCC TGAGATTGTA AGTTTCCTGA 
57601 GGCCTCCTCA GCCACGCAGA CTGTGAGTCA ATTAAACCTC TTTACTTTAT 
57651 AAATTACCCA GCCTCGGGTA TTTCTTCATA GCAGCGTGAG AATGGACTAA 
57701 TACAACTGTA TTTTAGGAAA CGTGCTACAT TAAAGATGGC TGCTAATTGA 
57751 TGTGCGTGTG TGTGTCTCTC TGTGTGTGTG TGTGTGTTGG CGGGGTCTAT 
57801 GTCTATAACT CGTCTCTTTA AATTTGGGTG GCTCTGTGAG TGTTCTGACC 
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57851 AAATGAATAT GCAGACATGA TTCTGCTCCA GTTTTATGGC TTAGGCTTGA 
57901 GGAGACTATA AGCTGCTTCT TCCTGTCTCT TGGATATTTA CTTGTGGAAC 
57951 TCAGTTTCCA TGCTGTGAGA AAACTCAATT ATTCCAGTGA AGAGTCTCAG 
58001 GTAAAAAGAA CTAAGGCCCT CAACTGAGGA CAGCCTCAGC TGAGCTTGCA 
58051 GCCAACAGCC AGCACTAACT TGTGATCCTG AGTCAGCTAT CTTGGAAGTG 
58101 GATACTCCAG TTCCATGTAA ACTGCCCAGC TAATACCACG TGGAGAAGGA 
58151 GAATTGCCTA GCCATGCCTG CCCAAATTCA CGACTTGCAA AATTGTGTCA 
58201 TATAATAAAA TAGTGGCTGT GTTAAGCATT GGATTTTGGG GTAGTTGTTA 
58251 TGCAGAACAA GAAGTTAGCT ATAATGGAAG CATACAGAAC TTGCAGACTT 
583 01 AGTGCCCTGA TTTCAAATTC TGGTTCTAAC ATTTCATAGC TATGTAACTT 
58351 TGGGGACAGC TATTAAGCCA CTGTCAGCCT TAAAGAAAAG TTTCCTTATC 
58401 TGTAAAATGG CAGTAAAAAT ATATCTCTTA AAGGCTGTGG TGAGGATTAG 
58451 AGACAAAGTG TGTAATATGA CTATTACATT TTCTGGAAAA TGATAACAGC 
58501 TTGACATAGT AGCAATTATT TGCTATTCAC CAGTTAACCC ACTTCTCTGT 
58551 ACCTTCTTCT TTTTAAGCAG ATCTTCCAGA CCAGGGTATT GCTGAGGAGA 
58601 ATATCTTTTT TGAAAACCAA CTTAGTGGAG TAATTTAAAG ATCTACAATG 
58651 TTGATATATC TTTCATTAAG TAATTTTATT TTATTACTGT ATTCTAAGGA 
58701 AATATTTCTA AATACTGGCC ATAGTACGTC TATCCAGGAA ATAAGATGCT 
58751 TATGTATGTT CCTGATCTCT ATGCATCGCT TTAAAAAATT CTGCATGTTC 
58801 ATTATTTTGA AGTGCATCAC TGTGTTTTAG ACACACCAAC CAAGGTGTTG 
58851 GAAGGCAATG GCTGAATCTG CCTCCTCAAG ATAACACCAT CCCCAACATA 
58901 CTTAGGAACT ATCAACAACA ATTAGGGAGG TTATGTATAT ATGAAATAAT 
58951 TCAATGAGGA AGTGTAACTT TTGAGAAGTG CCTTTTATTC TACAAATATT 
59001 TATTGAATAG CAACAAAGTT CTAGATTCTG GGCACAATCC AGGAGAACAG 
59051 AGATGAAACC ACAGCCTTTG TGCTTTTCTA GAGAGTCTGA TGAGTTGCCG 
59101 TTCTGGGCAG AGTCAAGAC* GCTTCTCAAA CATTCTATAT GTCTCAAACA 
59151 TGTCTTCACC TCCATGCTTC ' ATTGTTAGTC TTTCTTCTCA AAACCATCTT 
59201 CCACTTTATC AGTCAAAACA TTTTGCATGA CTCAATTTGA GCTCCACCTC 
59251 CTCCAAGCAA TCTTCCTGGA TCTGTGCTAT ACAACTTCTA GCTCCAATCT 
59301 GTCCCCTGTT GCTGACTTCT TGTAGAACTC AATGTGTTCT CGTTTCTGGC 
59351 CACTCTGTGT GGCTAGTTTT CTTTTTTAAA TGTCTGTATT ATTTCCACAA 
59401 CTAGATGAAA AATTCTTCAT CAGTGCATCT CAAACATGTT TCTTATGCAC 
59451 ATATGAGATA GTTAGGGGTC TTGTTAAAAT GCAGATTCTG ATTCTGTAGA 
59501 TTTGGGGCAA AACCTTAGAT TCTTCACAGT TAACAAGCTC CAGATGAAGT 
59551 CCCTGCTGCT AGATCTCAGA CTACAGCTTG AGTAGCAAGT ACCTAGGTTG 
59601 TAGAGGGCAG AGACCACCCT GTTACTCTGT TTCTTACAGT CCCTGGGGCA 
59651 TGAAAGAGTT CAGTGATTGT TTTGGTGGTA TAGTACACAG AAAAGTATCG 
59701 CAGACTGGTA AAGTTTGATA 0?TTGTCCCCT CTAAATCTCA TGCTGAAATT 
59751 TGATTCCCAG TGTTGGAGGT GGGGTCTAAT GGAAGATGTC TGGTTCATGA 
59801 GGGTGGATCC CTCATGAGTA GTTTGGTACT- GTATTCCCAG TAATGAGTTC 
59851 TAACTCTGTT AACTCCCTCA AGAGTTTCCT TCTCTCCACC CATGAGAACT 
59901 GATTGTTAAA AATATCCTGG CACCTCCCTT- TCTCTTCCTT TCTTTTTCAC 
59951 CACGTGATGC TACTTCCCTT TGCCTTTTGC TATGAGTGGA AGCAGGTTGA 
60001 GGTCCTCAGC AGAAGCAGAT GCTGGTGCCA TGCTTCTTGT ACAGTCTGCA 
60051 GAACCATGAG CCAAACTTAT TTTATTTACA AATTACCCAG CCTCAGGTAT 
60101 ATTCTTTATT GCGATGCAAA TGGATTAAGA CACAGGCCTT GGAACAATTT 
60151 TGACCTAGGT TACAATATCA GTTTTACTAC TTAATGATCT GGATGATTGT 
60201 GAGCATTTTA TTCACTTATA AAGCACACTT ACAGAGTTTT TGTGAAGTCA 
60251 AAAGTAACAT ATATTTTAAA GCACTTGGCA TAGCAGTCAT GGATAGCAAG 
60301 GGACTGACAT ATAAGATATA TTTCCTGTTA ATGGCACTGA ACAAAATCCA 
60351 TGTGTTTTCA GGCTAAAATA TTTCTTCACT AGAGTTGATC TTGATATTGT 
60401 GCAATAAATA TCTTAAGAGG AAAGAAGCTA CTAATATAAG CTTTGGCTTG 
60451 AGTTTTAAAT TTTATTATAC ATATATTTTA TCAGTTCTAA TATATATACC 
60501 AAAAAATGTA GAGAAAAAAT CATGCCGAGT GAAAATGGGC CAAGTGTAGC 
60551 CACATGTTGC TTCTAGTCTC TTATAGTATA AAGCTTCCTT GAAGACAATT 
60601 AAAGCTGATC AATAATAATT AAGGAGACAG TAATGTCACT CTTCCTGCTA 
60651 CAATGGTTTC TACCCAGCCT GTTTGTCAGA GATAGTGTCT AGGTGTTCTG 
60701 CAGATAAAGC CCTGACAAAG ACCCTCAAGA AGATAACTCA GGGTGAATAC 
60751 AAGAAGGGTT GGCTGCAAGC ATTCAGTGCT CAAACAGCTG CATTGGGAAA 
60801 GGCACTGATT GCTGAAGATA GCCCCGTGCT AGACCTCCTG AGTCAGCTCT 
60851 TCTTGTCCAC TGGATGAAAA AGTGGTTGCA ATATCTCCCA GGTTCTGACA 



WO 02/086113 » •cT/GB02/01887 

21/219 - 

60901 GTTTGTATTT CAAATGTGAG AGATTAGTAA TTTTTCTTTC ATGGATTACA 
60951 TGTCAGAGTA GCAAAATAAT TGGTAGAGGA AAGTTATCTT TTGAAAAGAA 
61001 TCACAGCTAA TAAATGAGAA GAAATGATAG AGAAAACCAC ATTTGCAAAA 
61051 CTTCATATGT CTAGATAATG GTCATTAATA GTTGCTAAAA CTGTTAGATA 
61101 AGAAACACTA GATGTTAAAT TTTATAATAG ATGGATCAGG CTGACATGAC 
61151 CTGAATTCAC TAATTAATCT TTCCATTACT AAAAGAGGAA CAAACGGGAC 

612 01 ATTATTTTTC TTTTGATGTA AGGCAATAGA AAGCATACAG AGACACTAAA 
61251 CAGTGTTTAA AAAACGACAT ATCTAGTCAG GCCTCTAGAT CTAACTCCCA 

613 01 GTTTACAAAA AATATGAGGA TGGAGGGTCA TGTTAAATGA CATCACAATG 
61351 CTTTGATCAG TGAAACCTAG ATATTCCACA AGACAAAGGA TTTAGTTGTT 
61401 TAAAGAAAAA GAACAAATAG CACACAAAGA GGAAGAATTA TAATATTTTA 
61451 AGAGAGACTT GTGATACACA TAATCTAAAT GTAATGTGTG AACCTTGTTT 
61501 GGATCCTGAT TCAAACAAAC TAACTTTATT GTTAATTTCA TAATACCTCT 
61551 ACTGAAGTCA TTTTTTAAAA AGTTCTATCT GTTGAGATAC TTACTGGAGT 
61601 ATCTATTGAT AACAGATGCC CACCTTTACT AGGAATCTGT GACATACAGA 
61651 TTCAAAGAAA TAAAATATCT CCCCTGTTAG AGTAGCCAAA ATTAAAATGT 
617 01 TTGTGCTGAC AATTGGAAGC AAAAGTGGGA GTGAATAAAA ACTCATGCAT 
61751 TTGGTGAGTG TTCAAATAGT AGAATCATTT TGGAGAGCAA TTTTGTGGTA' 
61801 TTTAGCAAAG TTGAAATTGT ATGTTCTTTA TGTCAAAAGT TTCAGGAATT 
61851 TCCAGAATTC CCTTTCTAGT TCTGTGAGTC AGCTGATTTA GGCTGGTTGG 
61901 ACAACTTCAT CTCCTGCTTA TGGATCTGGC TAGGCTAGGT TTCTTATTTA 
61951 GATTTGGACT CAGGCCTATC CCGTGTTGTT ATATTTTTTA AAAAAATTGA 
62001 TAGACTTCAT AGTTTTAGAT TGACAGAAAT AATACCAAGA CTTAACATAA 
62051 ACCACTCTCC TCCTGCATAA TTTCCCCTAG TATTATATTA GTACAGTACA 
62101 TTTGGTACAG ATATTGAATC AATATTGATA TGTTGCTATT AATTGATATC 
62151 CATAATTTTT CTCGATTTCT TTAGGTTTTA TGTATTATCT ATTTTCTGTC 
62201 CCAGGATCCC ATCTGAGATA TCATAGTACA TTTAGTTGTT ATGTCTTTTT 
62251 AGTCTCCTAC AGGCTGTGAC AGTTTCTTAG GCTTCCCTTG TTTTCAATGA 
62301 CCTTGACAGT CTTGAGGAGT ACTGGTCAGG TATTAATATT TTGGAGGATG 
62351 CCCTTATATT AAAATCTTTC CAATGTTTTT GTCATTATTA GATGGAGATT 
62401 ATGGATTTTG GTACACAGGA TCACAGAAAT AAAGTGCAAT TTTCATCAAA 
62451 TGATATCAAG GGTACATACT ATCAACATGA CTTATGATTG TTGATCGTAT 
62501 AATTGTTTAC CTTAATCATC TGCTGAAGCA GTGCTTGTCA GGTTTCTTTA 
62551 CTATAAATTT GCTGACTCCA GCTTTCCATA CTGTAGGCTT TTCCTAGATT 
62601 CCACATGGTA CAGTATGTGC AGCCTGCACT TGAGTGGGGA GTTATGCACT 
62 651 TCCTCCTTTA GGATAGAGTA TCTGTGTGAC TTATTTGGAA TTCTTCTGCA 
62701 TGGCAGATTT TTCTCTTCTC TCCGACTTGT TAATTTATTC AATCATTTAT 

.62751 TTATATGAGT ATAGGCTCTT TGATTTTTAA AATATCTGGG GTTACAATTC 
62801 AAGACTATTT ATTTCATTGT TTATATTGTT CCAATTTTGA CTATTTCCTT 
62851 TTGTCTCTTG TGTCCCTTTG ACATATCTCT GTGTGTGTGT GTGTGTGTGT 
62901 GTGTGTGTGT GTGTGTGCAT TTTTCTCAAG CATTTCTTAA TGGGCTGCAA 
62951 GATGCTCCAG GCTCATTTTG TATATTTTGT GCCCCCACCC TAGAATCAGT 
63001 CGTTTGTCTA AGCAGCCTTG TTTCTTTATT GAAAAATAGT ATTAGAAAAC 

.63051 AAGATCTGAC TGCAGTGTGT GCTCAGTGCT CCTACAGTGT CACTTCTTAT 
63101 GGGCTTGCTT GGCTGACAGA GCAAAGAAAT ATTTACGTAT ACTAATCCAT 
63151 ATTTGTCCAA TTATCTATAA ACATTTCTAT ATATAAAATG AAGTGCATCT 
63201 ATATTAAAAA TTAAACATAA GTTGTTACAA TTAGGTACAA TTAATAAATC 
63251 ATTAATTGGT ATGTCTCTAA CTCTAATTCA TTACCACATG GGTCACTCTA 
63301 GATAAGTTTT CCTTTCTTGT TTGGAATCTT CCGTTTAACA ATGAAAACCA 
63351 GCATCCACCA TCCATTTACG TAATTGTTCA TTATTAGTGT ACACATACAA 
63401 CAGTGTCAGA ATTGTTAATC CATACCACAT GGGAAACAAT TTTATTAGCC 
63451 AGAGTACAGT GTCTAGGTGC AGTTACTTTC ACCTTTAATC TTACAAATTC 
63501 CACTCATTTC CAAAGCTACT TTCTTCATCA CGTTTTCTCC CTAACCCTCT 
63551 TCAGTGAGTC TGTTTCATAC ATTTGTAACA CAGTTAAATT CCTCTGTCAC 
63601 ATTCTGCATT CTTTCCTGGG ATCCCCCAAC CTCCTAAATA ATTTAACAAA 
63651 TTTGTTATAC ATGAAGGTTC ATTCTTTGTG TTGTAAAATT CTATGAATTT 
63701 TGATAAATGT ATAATGTCAT GTGCCTTATT ACAGTATTAT ACAAAATAGT 
63751 TTTACCATCC TAAAAATTCC CTGTGCTTCT CTCCTCCTCT TTCTCTAAAC 
63801 .CCCTAGCAAG CAATGAATGT TTTACAGTCT CTATAGTTTT GTCCTTTCCA 
63851 GGATGTCATA TAATTGGAAT TCTAAACTAT GTAGCCTTTT TGTACTGGCT 
63901 TCTTTCACTT AGCATTATGC ATTTAGGATT CCTCTATGTC CTTTGTAGTT 
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63951 TGATAGTATG TTTCTTTTTT ACCAAAGAGT AGCATTCCAT TGTATGAGGT 
64001 ACCACAGTTT GTTTATCCAC TCACCTATTG AAGGGCGTAT TAGCTGTTTT 
64051 CAGTTTGGGG TGATTATGAA TAAAAGTACT ATAAACCTTT GTGTGCACAT 
64101 TTTTGTGTAG ATATGCATTT TCAACTCATT AGTATCAACA ACTAGAAGCA 
64151 ATTGGTGGAT TGCATGGTAA GACTATGTTA TCTATGTAAG AAACTACTAA 

642 01 ATTGTCTTCC AAAGTGGCTG TAGCCTTTTG CATTATCACC AGCAATGAAG 
64251 ATGACAGTTC CCATTGCTCT GCATTCTTGC AAACATTTGG TATTGCCAAT 

643 01 TTTTTTGAGC TATGCTAATA GGTGTTTAGA GATATCTCAT TGTTGTTTTA 
643 51 ATTTTCAGTT TCCTATTGAT GAAATATATT GAGAATCCTT TTATTAGGTG 
64401 ATTTGTCACT TGTGGATCTT TGTTGAGATG TGTCTTCAGA TCTTTTGCTC 
64451 ACTTAAAAAT TTTTTTTAAT TAAATGTTGA GTTTTTAATA CAGTTTAGAT 
64501 ATAAGTCCTT TATCAGGCAT GTGTTTTGCA AATATTTTCT CCCAGTCTGT 
64551 GTGGCTTGTC TTCTCTGTCT CTTAACACTC ATTTTATTTT TACTTTTGGG 
64601 ACTAAAGATG AGGGATGATC AGCTTCCTGT TCTTCTTGTA ATAATAATGG 
64651 CAGAGCCAAG AGAGAATGCA TCAACCTTGT AATCCCATTG GAAGACCCTG 
647 01 ATTGTGTCAA GTCCGCTAAT GTTACATGGA GTAAACAAAG TCACATGGTC 
64751 AAATCCTACA TTAATGAAAT GGGGTAGTAT ATGCAGTTAQ GGGATGGGGA 
64801 ATAATAAATG CTTTTTAAGA ATAATATAAT CTGCCATGCC ATCTATGAGC 
64851 TGATACTAGC TTTCTATTAA TTTATTCTGT GGTTTGTTTA ACGTTGGAAT 
64901 CTCTCTGAAA GATCCTTCAA CCTTCCTGGG AGACTCTAAT CTGAGAAAGC 
64951 CATTAAAAAC GCGTTTCTTT TTTTTGGCAT AATTTAAATT CTTAGACATC 
65001 GTTCACATTT ATTAATGTTA CATACATAAG AATTTACCCA CCATATTCCT 
65051 CCTTGAGTCC AGTTCAGTAT AAAAAAAATG TTGACTTCTC CCCAGTATCT 
65101 CTTCATGGTT TACAGCATGA AATTCTGTTC TATAACCATT TTTCACAATT 
65151 GTCTAACAAT GTTATAATAG ATTGGTTGCT TACTACCAAT TTTTAAAAAA 
65201 GTCATGGCTT CTCCAGTCAC TTCTTGCTTA TCAAAATTAT TTCATGAGGT 
65251 GGTTTAGGCC TTGCATATTT GAATATGGGG TTTGCTCCTT TCTACTTAAA 
65301 AAACAACTTG GATCACACTG TCTTGTCCTG ATAACTTGGT AGAAGTTGCT 
65351 TCATCATTGT TATTCTTGTT GTGGTGGTGG TTATTAATAT TGTTTCATAT 
65401 TCTATTTTAA GAATTAAAAT TTTAAAGAAG ATCAATGTCT TTGAGTATCT 
65451 TCCAAATTCC AGTACATTTT TAAAAAAAGT AACTAATGTT ATCAGCTTTG 
65501 TATGTTTAAA AAGAAGTAAC TAATGCTATC GGCTTTGTAT GTTATCACTC 
65551 AAGACTATTT TATATGTCTT TAGTTATATA CGTGTTTGTA TACTGTATAT 
65601 ATATTATTCC ATTTCTTCTA GAAGAATTGA TTTTTGAACA GAACTTTTTC 
65651 TTTTCCCAGA TGGACCCTCC TTAGGAATCT TTTGTTGATC CTTGAAGTTT 
65701 GGTAACATCT TCAAATGTGT ATATCTCCCT GTTAATTATT TTATGGCATT 
65751 TTCCTGGACA TTTTTATACA TATGTATACT TAAAATGAAA GAATAATGTT 
65801 CCAAAATATG GTAGAAGAAA AAATTCTTTT CTTTCAGAGG TTTCCAACAC 
65851 ATTTTATTTT GCAGATAACT GATTTTAAGC TTTTGAGGAC CAACATCTCT 
65901 ATCAATTCCT ATAAAATGTC CAATTAGTTT CAGTTACATA GCAGGCTGTT 
65951 CAGTACTGCA CACCATACTA TGGCTGGTGG TCTGGTCACA CATGTATAAA 
66001 GGCTGCCCAC TGGTTCCTGG AGCACGGCGT GTCCAAGTGA ACGCCTCTTC 
66051 CAGGGAGCTT CGTTTTCCCA GGGGTGGTTT CCTTTGGCAA GAAGTTGGTC 
66101 AGTTTACACA TGTGGCTTTC GTGCCTGTGA GCAGGAAGTG TCTGAAACTT 
66151 GCAGGATGCT CTGCTTTTTC TCTTATGAAG ACAGACACCT GAGAATCCAG 
66201 TGCCCTTATG CTTAATGACA GGAATTCCTC CTCATTGTTT AGTAGGTTAG 
66251 GATATAAGAA AGCCTCTTTT TAAGTAACAC AGATAACCTT TTTAGATTAC 
66301 TTATTTATTC TAGAACTATA AAATTCAAGA AAATATAGGT CCCATGAAAG 
66351 ATTTAAGAGT TTTAAAAAAC TAAAATCTAG TTTCTCCCAG CAAATAGTAC 
66401 TTCAAGCAAA ACCCTTCCAA TGTTTCTAAA ATAATACTGA AAGGGCATCT 
66451 GATTATACCA GAGCAATTTA AAGAAGTATA TATTTATTTG AATAATAAGC 
66501 TTACATTTTA GCTGCAATTT TGGCAATGCA GATTTTTAAC ATGGATCACA 
66551 GAAAATGTGC ACAAAAAAAG TACTAGTGCA AAGGACTAAA CAATGCTAAG 
66601 AATTAGACTA AATACAGCTG CTGCATTTAA GAAAACAAAA GGQCTGAAAT 
66651 CACTATAAAA ATATAAAATG TATTAAACAC TACCGTCCAT AGAACAGTCT 
66701 TTATTATTAA TTATGTTTAA AATTATTTGC ATAGTTAATT ATACATTGAA 
66751 TTGTAAATGA GTATTATACA TGAACCTCCT TTTTGAAGGG AATTCTTTGT 
66801 TGCATTATGA AATGTCTAAT GACATTATAT TGCAAAAAGT ATCCTTTTTT 
66851 TCTATTGATA AAGGAAACCA TTAATGGTAT TACTGTAAAT ACAAGGAAGG 
66901 CTACAGAAAA AGAAATACAA TTTCATTTTT TCAAAGTGTT TTTCATGCAG 
66951 TGAAGTACTT GCTGTGTTGA ACTGAATGTC CTGTGTCTAA GATGCTCTTG 
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67001 AATGACAAGA CAAGGCTTTT CAATTCAAAG ACTAGAAGTA ATCATGTAAC 
67051 CCTCTTATGA CTGGGATACC GTCGTGTGCC ACTCACCAAC GCTGTCTTTC 
67101 CAGAAAACCA TTCAAGACAC TTAAAAAACC ATGACTTACC TGATAAATAA 
67151 ACATAAAATG AAAAGACATC AACTGCCATT TGACATTGCT ATTGGTAATG 
67201 TCTGTCCTAT GTGAAAGCAC TTTTAAAAAG AGAGCACCTA TGTGTAAGAG 
67251 ATAAGTGTCT AGAGATTCAA AATGAATCAA CAGTATCAGG TAACAATATA 
67301 ATTCTCAACT CAGAAGCTGA CTCAAGATTA GGGGCAATTT CAGTTAATGT 
67351 AACAGGAAAA AAAGACAATG GATTTTATTT TGTTGATGTA TCCACTCACA 
67401 AATTGACTTA GGGTCACCAG ATGTGTAGAC ACAATGAGAT TTTTATTGTT 
67451 TATAGTCTTT. AATGGAAGCG ATGATAAAGG AAATAGATCT ATTTTAAAAC 
67501 AAAACCAAAA AGCTATTTCT GTCTAGATTA AGAAGTGGCC CCAGGTGTGG 
67551 GATTCACATT TTGAGTGGCT ACTCGGGCAA CCTCAATGAT CCAGGGTTGG 
67601 TCTATGATTC CAGCTGCATG GCTGCCCTTC TCTGCAATCC CAATAATCCA 
67651 TGCTTGGTGG CCTCCACCAC ATTTGGGGGA GTTTATCTCT GCACAGAACC 
67701 AAGCTGCTTG CTCATGTGGT AAACAGATCA GAATGCCTCC TGCTGTCTCT 
67751 GGACATGTTC CCTGCATGAG GCTGAACATG TTTCCACAGG CCTTGCTCAT 
67801 CACAGTCATC TTGGCCAGCA CAGGGACCGT TGTGAATTGC AAAAAACCAT 
67851 TCCTCAGCTG CTTGGTCAGG TTCTGTGTAT GACCCCAAAT CCTAATGCCC 
67901 ACGATGTCCG TGGCTGCATG GGCACTGAAC GTGTGCATGA TGAAGCTTTC 
67951 CTGTTGAACT TCACCATGTT CATTATTGCT TCTTGGTATG TCAACTCTAC 
68001 ATCTTCGTGG GTGACCACTA GTTTATTTCA TTTTTCAGGA ATATCCAGCC 
68051 ACTGCTATTT GATACAAGTG ACAGTCGTGC ACAGCCACTC CACTTGTGCC 
68101 CCCAGGGGTT TTGTGGACTC CAGCACATCC CCTGGCGCTG TATTGTCTGG 
68151 CATGATAAAT TGATTGGCCT GTAGTTGTAG TGACTCCTCC CAAAACAATC 
68201 CAGGAGGTTA GCACTGTTTG GCCACAAGTT ACAGACATTC CTGCTTCCTC 
68251 GGCTGCATCT TTAAAACCCT GCATAATCAG GGGCATCACT TTATCCCTTT 
68301 CCCTGTCAGT TGTTTCATTA TTGATGAAAG AAAAAATGTA TTCAGATTGA 
68351 TTTCTTTAGT TCAGAAATTG TTTCTTTTAT TGTATCTTTG GACATTTTTT 
68401 TCTTTTATTG TTCTGTTTCC TTTTACTTCA AAAACACCTA TTATGTATAT 
68451 ACTGGATCCC ATGTTCTACA TATATTATCC TTCTTAAACT TATGAATTCT 
68501' TTTTCCAATT TCATGTTTCT CATTTTAGTT GTCCAAAGCC TGTTTCAAAT 
68551 GTCCCTTGGC TGTTTTCAGT GGGTTCAATT CTACTTCATA CTGACTACAA 
68601 ATTGTTTTCA TTTCCATGAT TTTTTATTAT ACCTTCCATT TATTTACTGA 
68651 TATCTGCCAG CTTCGTATTC ATTTCCTTAC TACACCTTAT ATTTCTTTTT 
68701 TTGATTGGTT TATCTTTCAT TTATATTCTG ATTTCAATGA GGTTATACTT 
68751 TCTATAATTT ATAGAATGTC ATAAAAATCT TTTAAATGTT CCCCTGCTTC 
68801 CTGGTATGAC CTTGAAGAGC AGTCTTTATA TGCCTTCTGT TTGTTATTAT 
68851 TATTTTCATT CTTTTTACTT GTTGTCTATA CTGAACCCTA ATGGATTATA 
68901 AATTATATTT GTGGTGCTCA TTTTGGGGCT AGCTGTTGGA GCAGATATGA 
68951 TACAGGCTGG GGAAATGAAC CAGCAGAAAG GCAGGTTCAT CCAGGCTTGT 
69001 TGTTTCTAAG CACCTCACCA ACTCTGCCCC AGGGTGTATG TGCTCTGTGC 
69051 TCTCAGAGTG AGGCTAGCAG GGCCCTTGAG GAGTCAGGAA TCCCTGTGCC 
69101 AAATTGTTCA GTCCTGTATG TCACAAGTCT CTCTCTCTTT CTTTTCTTTC 
69151 TCTTTCTCTC CCTTCAGTAT TTTATGTGTT TATTTTCCTT CAGCTGATCG 
69201 TAAGAGGCAG CTGCAAGGCA GAAAACCTAC CCAGGCCACC CTAGAATCTG 
69251 GCCCCTTGAG CAGATAGATG CCTGCAGATT TCCTCATTTG TTCCTCTTGC 
69301 CTAAACCTTA CAGGGGTGAG AATTCAGTGC TGGGTCAATA GCTCCTCAAA 
69351 CCCTTGACCT TGCTCTCCTA AATAAGGGAA TTGTACTTTG GACTCTTCAA 
69401 GTCCTTGCTA CATAATGTTA AAGGTTTTCT GTCATGCGAG AACATAAAGG 
69451 TACATGTTAT CATGTTTTAT CCTTGTGTCT TCCATAAACT GTATACATTT 
69501 AACATATGGA TTGTGGATAT TTTCTAACTT TAATGAAGAT GGAACTCTTT 
69551 TTCTCTCATC GTGGTGTTTT CACTTGGTGG TATATGTTCT TTATTTCACC 
69601 ATTTCTAACC ATATGTCTTT CTAAATTTTA AGATATGTTT TTCTAAGTCA 
69651 AGTGGGGTAG CATGTGCCTG TAGTCCCAGA TGCTTGGCAG GCTGAGGCAG 
69701 GAGGATCATT TGAGCCCAGG AGTTTGAGTC CAGAAACATA GTGAGAl'CCC 
69751 TCTCTTAATG AACAGCAGAA AAGAGAAGAT ATATAGGTTT TCCTAATTTT 
69801 AGCAGTAACA TACTCTATAT AAGAAACTTG GAAAAAAACA TTAAATCATA 
69851 CAGCTAAATA AGTAAATACA TGAACAAACA AATAAATAAC ATGACCCATT 
69901 ACCTACCACC TAGATATATC AGTGCTTATC AGCTAACTTT TATTTCTTCA 
69951 GTGAATTTCC ATGGCCTTCA AAGTCGTCAA AGCCCTATAG ACAGGAAACC 
70001 CATTCAATTT TCTTTAATCA AATATCTAAT TGTTTACCTG ATGACAATAA 
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70051 AAACACAAAA TACAACCAAT TTCAAAAGTT GTTAGTTAAG TGATAAACCA 
.70101 AGAAATATTG ATTAAGACTA TTAGTTGGTG ATATAATTAT TATTAATAAA 
70151 AACAACTGAA AAAGGTAAAG GTAGTTGCCT CATGTATGAG GAAGTAGGGA 
70201 AGAGGAAGAG GTAATGTGAG ATTTGCTTTT CAACTTAAGC TTTTATTATT 
70251 TGATATTTTC TAACTATGTA AATGGGTTAC TTTGAGATAA AAATCATTCA 
70301 TTAAAAATAA ATCAAAATAA ATTTATTTTG ATTACATGTT TTTGATTACA 
70351 TGTTACTGAA AATGTGTTTT TTGTGTAAAA TTTTTACATT TTGTTAAGTG 
70401 ATTCTCTTAT CACAGTCTTT AATTTTCAAC CATTATGTAA TAGGAATTAA 
70451 ATAAGGAATA ATACTTAGAT CACTGTTTTG GTTATGTATG TAAACATCAC 
70501 ATTTCTTGAC TACTGACACA CAATGCCTGT TTCCCTAATG CTTTCTGATT 
70551 CTTTGAGTTC TACATCACAT GACTTACGTC TTATGTGTTT GGTTATATGA 
70601 GCAAAAATAC ACAAATTTGT CATGAGATCA TATATATTTC ATTGTCTCTT 
70651 TCTTATAGAG TCTGTTAGGG GAACACGGAA AGCATCCTCA TGCCCCCTGG 
70701 GTGTTCTTGT TTCTGGATTT GTAACTCGGC TTTCAGAGTC TTCCTACTAT 
70751 TTCTCTCAGC AGCTATAAGA AGCAAGCTCT TGTCCTGCTC CTTGTTACCC 
70801 AGTTGGAAGA CTCCTGACTT AGGCAGGGAG AAAAAAGAGG CAAGTGACAC 
70851 ACTTTCCAGG TCCAGATGTC AGATGAGCTT TTGAATCTCC TTGGGCCAAT 
70901 TTTTTCTCAT CCACAAATCC AGGGTGTCAG ACTGAGTAAC TGGGAAGGTT 
70951 TATTTGGAAA CTAAAATGTT CAGATTCCAC AAAAGGGTGG GATAATATCA 
71001 TTTGTGTCAC ATAATCCTTA TCTAGGCAGT GATGTCAGAT GATTGCAGGC 
71051 CTCTATCTCA TACCTCTGGT TGCCAAACAC CTATAGGAGT TCACACAGAT 
71101 GAGCCCCTAT GGAGCACTTG AAATGAGGGG GGCAAGCGGA CCTGGGTTTC 
71151 ACTGAAGAGG GAAAATGCAG AAGCAAATGA TTTGGCTACT ATCACATTTT 
71201 GAAATATAAA GATGTGAGAG TACGTGGGTG TTGCTGGAGT TGGAAAAATA 
71251 GATGAGTTTT AGCTCACAAA TCCAATCTCT TTAAATACCT GGAGAGACTC 
71301 CAAGCTTCCT GCAGAAAACC AAACCAAAAC AAAACAAAAC CACTTTCTTA 
71351 ACCTCCCCTT CATTTCACGA GCATCATAAT GAGAGCAAGG TATGCATAAT 
71401 GAGGAGCTCT GCGAGGTCAG ATCAATCATA AAAGCAGGAG CAATTTAATC 
71451 CCCTAGACCC TGCAAATAAC TGTGGCAGGA GGCTGAGGCC ATCTTTCCAC 
71501 AGGTTTAGAG AATTCATTTA TTCTTTTGAT CCGAATTTTA AGTTCAAAGG 
71551 ATTCCAAAAG CAAAATAAAA CAAAAGGAAA AAAAAAAACC CTTGGGATTT 
71601 GCATTACATT TGTTTTGTAT GGTTTATTGA TTTTATTTTG TTCTCTCACA 
71651 CACCATTTGA GGCAGGCACA AGAAGGATCA TTTCCCCCAT TTCATAGCCG 
71701 GGGTGGCTGA GGCACAGAAA GATTGCGTGA CTTGTTCAAT ATCACATAGT 
71751 GATTTTTCAA AGGTAGGACT TAAACCTGAG GTTTCTOACT CTTTGGCAAT 
71801 TTCTCCTGCC TGCTCACTTT GCTGCTCCTC AAAGTCTTTC ACACAGGTGG 
71851 CTCAGGAGGT AACATATACT AAGTACCCTG TGTGGGGCCT AGCAGAGAGT 
71901 AGATGCTTAA GAAATGGTAG CTACTCCCTT AGTTTTAGCC ACTCCATGTT 
71951 AGGCCCTACA TTCTGTAGTG CAGGACCTTG TTTGGTAAAG TTTTTCTTAA 
72001 TGATCCCTGC CATGAGGAAT GAAAAATTAG ACTTACCAAG CACTGTATTT 
72051 GGAAAACATG TGGGTTATGA AGTCTGCAAA ATTTGCACCC AGGGCTAGCA 
72101 CAGCTAACTC ATTTTTTGCC AGGCTTTTGC TCTTTCTTTC TATTTCAGTA 
72151 AAACTTCCGT CAAGTATATT CAAACAAGAT ATGACTACTG GCATTAACAT 
72201 TAGTGAAATG TGTTAAAATT TGTCTTCCTA AGTAAATGAG AGAACTCTAG 
72251 TCATGATTCT GCATTGCTGG AATTTGCAGA TAGTTTAAAA TCACCTTCCA 
72301 GTCATCTATG CTCTATATGC AGGGCACATG TTTGTTTTAT TTTCTCTTTT 
72351 TTTCCCTCAA TTATTTTATG TTGGCTTTGA GGCTCAGGAG TCATAACACT 
72401 ATTGCGTGAT TAATTTTGTG CCTCTATTAT GATGGAGTTT TTGTGAAATT 
72451 AACTGTGCTG TTTCCTTCCC AGTAATGTCC TCTGATGAAT ATTTTATTTC 
72501 TCTCAATTGC TGTGTGGTCT GCCTTGCTTG ATATTTTGAG GTACTTTTAC 
72551 AAATAACAAG AGCCTCTGCC CCTCAGAGTA CTGTGCCTTT GTAGGCAATC 
72601 CAGAAGTCAG CCAAGAAAGC TCTTCCCTTT AAGAGCACAG AACATGTCCA 
72651 ATTTTTGTGC CCACAGGAAA GCAACAAGAT TTTCCTGACC TCCCAAAAGC 
72701 CATCAATTAG AAATTACAAC AGCATCAGCA CAGAAAAGGG GACATAGTTG 
72751 CATCTCGCCA TCATGGAGAT GTGTTACCAG GCCAGTCGAG TCCCATGTTT 
72801 ATCTGAGACT ACAGAGAATT CCTTGGGTTT CCTATTTATT CTCCCCTGAG 
72851 CACAGTGATG GCTCTTGAAT ATTCGATACA GACTCCCTCC TGGCAGCTCC 
72901 AAGCCTTATG CAGCTGCCTG TTTTTCTAAG GAGGGCATAA GTTCAGGAAC 
72951 AAGAAAAAAA ACATGCTGAA GGAGTGACAG GTGCAGGTGC AAGTCAAGAG 
73001 AAAGGGAAAA AATGGAGCAG CTACTGATTG CCAGACACTG TGCTCGGTGC 
73051 TTTAAAAATA TTACCTCCTT AATTGTCACA ATCACCTGGT AATGGGGTTA 
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73101 TGATCTCCAA TCTACATATA AGACCTCTGG AGCTCAGAGA GGTTAAGTAA 
73151 CTTGTCTAAG GAGCCTGGTG TCCAGCATTC AAATATTTGT TAAGTCAATC 
73201 ATGAATGAAT AAATGTTATA CAGCCAATGC AAGGAGTGAA TGAAGAGAGC 
73251 CCCTTTTAAC TCTGTTTTAT AGGAATAAAT ATTCTCTGTG GCAGTGGGAA 
73301 GACCTAGCAG ACTGGGCATG GGAGCTTATG AAGCAAAGTA GTTTGAATGA 
73351 ATTGCTGGCA AGATAAGGAC AGTGATGTGT CTCTGTTTTC CCAGTGTGCA 
73401 CTCTGTAGTC TGTAGTTTTT GTACTGTAGT CTTTGTACGA AGTCCTGAGT 
73451 GATGGCTCTG GCCATTCCAT GTCCCACTCC ATCCTTGATT TGCCAGAGTC 
73501 ATCAAGAGGT CCCCAGCGTT AGCTGAGCTA CTTGATGTGA TCTCGTTATA 
73551 CATAATCCTG CGTATGGCAA CTAAGAACTG GAAACAACCA TCAAAGTCCT 
73601 AAAAATTTTC TGCCTTTTGG GGCACTTGGA AGTAAGGTAA AGAATGAAGC 
73651 ACTTTGGGGT TAGACAACAA ACATGATTTT CACATGTAAG TGACATGGGC 
73701 TGTGCCCTGC AATCCTGCTC ATGAACTTAT GTCTTTCTTC AGGCCCTGTT 
73751 TCTACTAAAT CTTGTGATTT TCCAGATTTC ACTGAGAGCA GATTTTATAC 
73801 CCCCTTTGAA ATAATTTGGT TTCTGCATTC AAAACTTCTG TTATTGTGAT 
73851 GGTCTGGAGG GCAAGCTGCA GAAAATGCTT TCTTCCCCAT GAATTAAACG 
73901 CTCTTAGGCA AAAGGAAAGA GGGTGTGAGA TTGTCGTGCC TGCCTGAAAG 
73951 CAGGCAGGTA TCTCCAGCAT CCATATCAAA GCCTTCAGTG CTCATGGTGA 
74001 CATTAATAAT AAATGACGAA TTTCTTGCAG GCTTTCAATT CAGCAACATT 
74051 GCTTTTCCTG TGGTTAGGTC AGTTCCAGTA TGTTTGGCTC TAGTATTTCA 
74101 GAAGTCTCCT GACAAACAAA TGCAAAAAGA ATTAGCAGCG TGCAATGATT 
74151 TTGCTTTGCT CACCATAAAC AGCCTTTTAG CTTAAAAAGA AATAAAGGTA 
74201 CAGTTTTTCA AGTGAATCTT GGTTATTAAA GCTTCTCTGA GGATTACTAA 
74251 TCTAAGTAAC TGCTAGCGTG GTCTTGAGTT CTCTTTTCTG ATATTGATGG 
74301 GCCAATTAGA TGGAACTAGG TGACAGAAAT TGGGCTAGTT ACCCAAGGTG 
74351 GTACTAGAAA CAGTGTTGGT GGGCTTCACC TCTCTGCCAT AGCAGAGGGC 
74401 AGTTGGAAAA CAAAACAAAA CTTCCACTAA TTTCTCTCCA TCCCCACTGG 
74451 ACCTGCCAAA CCACCACTCT TAACTCTCAA ATGACCACGC AGAAATCTCA 
74501 CTGCTCTTTA TGCTTCCCCT TTCTGCTCCA ATTTAGAAAG CCATGGTGCA 
74551 CTTTTGAAAC AAATGCTATC TTAAGCAATT GATTCCGTGG TTGAAACCTT 
74601 CCAATGACTT TCTATTGCAC TTTGTATAAA ATCCAAAATC CTACCATGGC 
74651 TCCTAAGGGG CCCAGCAGTG TCCCTGGTCT CATGTCTGTG GCTCTGTTGC 
74701 TCCATTACTT ACTGTTATCT AGCCATGCCG GACTTCCAGA AAACCCTGGG 
74751 ACTCAGCGGG CTCTTCCCTC ACTGCTGCAG CCTCTGCACA TGCTATCTCA 
74801 CACCTCTGCC TTCACTGTCT TCCTCTGTCT CTGTCTCACC TATATTTTCC 
74851 CTTCAGGTCT C AGTTTAAAG TTGC ATCCCC AGAAAGTCTC CCTGATTAAA 
74901 ATAAAAATAT ACATTCATAC TTAGGACTTA AGATGAGGAC TGTTTGCTAG 
74951 TGTATATTTT ACTTGAGATT TTTTTAAAGC CTGCCTTCCT TCTGGTTTAT 
75001 AAACTCAGTT GAGCAATAAT CAAGGCTGTT TTATATACCA CTATATCCTT 
75051 AGCACCTAAC AGAGTGCCTG ACACTTAAGA GGTGITATAA ATATTTATTG 
75101 AATGAATTAA AAAATGGCAT CTTGTGCCCT CTATCAATTT TATTTACCTT 
75151 TATTTTTAAA GCCACTGTAT GGGATATCCA GTTTCAGCCC TCAGGTAAAC 
75201 GGGCACAGCT TTGCTGGGTG TTCATACTGC TCGATGGGCT TGCTGGCTGC 
75251 IKSCCCCACAG CTGAGCATGT ATTTCGTGTC CTAACCCAGG TTTGGCCCTT 
75301 GCAACCTGAC CTCAGACCCA TTGTTCTAAA TTGGATCTCT GATTGCCCTG 
75351 CCTAGTAGTC TGAGCCTGTG CTCAAAGCCT GTTTTAATGA AAGCTTTCAA 
75401 ATCTTCAGTT TTAAAAATCT GTAGTATCCT GATAGGTCTC TCTGTAGAAT 
75451 AGCTGTCACT CTTTATATAT AGCTCTAAGA GAGTCAGACC AAGCCTAATT 
75501 CAACTTCTTG TCTTGGCTTG TATAACTTTG AAGGCTAAGT GGAAATATTT 
75551 ATAAATTAAT CAGCATTGAA AATATAGTGG TTGTTGGAAT TTACAAGAAG 
75601 TCAGGCCTCT GGAATTTCCT GAAGCCATAG AACTTAAGAA TGTAAAATAA 
75651 CATACTTTGG GAATAAAAGT ATAAATAGAA GTCTGATGGA TATACCAACT 
75701 GTGCCTTCAT CTAACATTTC CCTTACTGCT ACTGTCATGT GAAGTCAAGA 
75751 GTAAGCCCAT TGTTTTA.TAG AGATATCATT CCAAATTCTG TGAAGACTTC 
75801 TCATATATTT CATTATACCA TGTTTTATTT TATTTTGATA AGAACACAGC 
75851 ATGAGATCTA TCCTCAACAA ATATATAAGT TGAGTTGACT ATAGGTATAA 
75901 TGTTGTACAG CAGATGTCTA GGGCTTATTC ATGTTGTTTC AGACTTTTTA 
75951 TCCATTGATT AGTGACTCTC CATTTTCTTC TATTCCTAGG CCTTTGTCAC 
76001 CAATAGTCTA CTCTTTGATT CTATGAATTT GACCATTTGA GACACTTCAT 
76051 GTAAGTGAAA TCATGCACGC TTGGTATTCT TGTGACTGGC TTATTTTACT 
76101 TAGCGTAATA TCCTCAAGGT TCATCCATAT TGTTGCACAT AGCAGAATTT 



WO 02/086113^ A PCT/GB02/01887 

26/219 

76151 CCTTCCTTTT CTAAGGTTGC ATAGTGTTCC ATTGTATGTA TATACAACAT 
7 6201 TTTCTTCATC CATTCATTCA TCAATGTTTT CATATCTTGC TTATTGTAAA 
76251 TTGTGTTACA GTGAATATGG GACTGTCAAT ATCTTCTCAA GATTCCTATT 
76301 GCAATTATTC TGTAGAAATA CCCAGAGGTG GGATTTGTGA AAATTTAAAA 
76351 CATCTTACCC TTGACTCTCC AGACTTCTTA CTTGACGCTG ACTCTCTAGA 
76401 GGAGTGCAAC TGCATTTATT CCCGAAGTGT CTCACAAATA CTGTAGCATG 
76451 TTTAATACCG ATACCCATCA AATATACTGA TTCTGTGGCA AAGAATTCAT 
76501 CAAAATCTCA ATGCATCATA AGGAAAGGGA CAATGATACA GAACATCTAA 
7 6551 GAATTCTACT TCAATATCTT CTCCTTTAAA ACAACAAAAA AACCAATCCT 
76601 CAACCTCTCA TTAATAAATG CTATGTCCCT GATAGGAAAA AAGACTTTCG 
76651 GGGCTGGGTA CGATGGCTCA CGCCTGTAAT CCCAGCACTT TGGGAGGCCG 
7 6701 AGGCAGGCGA ATCACTTGAG GTCACGAGTT TGAGACTAAC CTGGCCAACA 
7 6751 TGGTTGAAAC CCCATCTCTA CTAAAAATAC AAAAATTAGC CGGGCGTGGT 
76801 GGCATACGCC TGTAGTACCA GCTACTCGAG AGGCTGGGGT GGGAGAATCA 
76851 CTTGAATCTG GGAGGTGGAG GTTGCAGTGA GCCAAGATTG AGCCATTGCA 
76901 CTTCAGCCTG GGCTACAGAG CAAGACTCTG TCTCAAAAAA AAAAAAAAAA 
76951 AAAAAAAGTC AAAAGACTCT CTGGCTCTCC TGATGGCATT TTTTTTAATC 
77001 ACTGTAAACA AAATATCAGT GAAAGGTCGA GTCTGTCTAT GCCTTCCCTC 
77051 TTACCAATTT ACAAAATAAA CACTTACTGA TTTGATCGTT AGAGTTTAAA 
77101 CTATTAAACT GAAGACATAA GCTCTTTGAA CAAATATTGA AGGGAAGAGA 
77151 CTGATTAGAA ATTCTCATGA GGCCAAAATT TGATACACTG ATGATTACAG 
77201 CTTCTCTTAC GTGTCAGGGT CTTTGGAAGA AAATATGGCT CTAGATTTTC 
77251 TGAAGATGCA TCCAGCGTAC ACCAGCCAGC AGACTTTGGG CAATCGTTTC 
77301 CTTGAGTGTT CTTCCAGTTT ATTAAACAGT AAATCAGTTT TGAGTTTTGC 
77351 TTTGTAAACT GAACTTTCCA AACAAGGGTA TTGAAGTGAT GCATAACTGA 
77401 AGGCCTAAGT GTCCGTTCAT TTTTGGGATT CAATAAGCAT TATTTTGAAA 
77451 ACAAACCAAC TTAATCAATG TTAGCCCTAC CCAAAATGAG TTTACACTAA 
77501 AATCAAATTC TTCTTTTTTT AAATCCACCT GAAAAAAATC AGTGCAGAGA 
77551 ATTATTAGGA AAAAATTAGT GCAGAGAATG AGTTTTTTTT TTTTTTTTTT 
77601 TTTTTTCCCC ACTGGGGTCA AGCTCTGAAT GCATTGAGAA TCTCAGGAAA 
77651 GTGTAAGATG TAGTCTGGGC GGTGATTTCC TTCGGGGCTG AGGGTCATAC 
77701 . TAAAAAGCAG GCTTGGTTGG AAAGGTTGAT TGCATACAGT GTGATTGTAA 
77751 GCTTTGTGCC TCACCCTCAC TGGTTACAAT GAAACCCCAC TCAATGTTAA 
77801 CACATTTTCA AACTCTCTTT GCATCTGACA TTTCACCATA AAAGCCCATT 
77851 ACTTTGGGAA AGTCATTTCT TGGTAGTCAT TCTGGTTGAA ATCGCCTCCC 
77901 TCAAAAAATC ACAAACAGGA TATGTAGGGA CTGGTGAGAC TAAGGTACAG 
77951 AAACTGGAAT AGGAGAAAGG GAGACAAAAA ACTAGAGCTG TATCAAGCCT 
78001 ATTTGGGCTA AGAGAGTGGG AAGGAGCAAA GGACTATGCC AGTGAGTCAA 
78051 AGGACCGTGA GGGGTGGGCT GGAAGATGAG AGTGCACCAT CCACAAGTCC 
78101 TGCCTCTTTC CTGGCTTCAT TTATACCAAA AACATCTATC TGGGTAAGGC 
78151 AGAAAATTAA TGACACCTTT GGTCCCCTAA ATACGCTCTA GACAACCGAC 
78201 TATGTCAGAA GAAAACACAA AGAAGAGAGG TTACCTAAAT GAAGGGTAAC 
78251 TGAATTTTTC ATTTCGTATA GAGCTTTATA CACAATTTTA CCAAACATTT 
78301 AATGTGAAAT TTACTGTGTA AAATAGCAGT TTCTATTGAG GCCAAGGCAG 
78351 ACATTTTCTG GCCACATGTG TATAACTTTC AAAGGACCGT GAGAGACACA 
78401 GAGAGAAGAG GGAAGAGACA CAGTTAGACA GTGAGAGCAA AGGCCTACCT 
78451 GTAGCTTTCT CTGAACCAGA CAGAGAGTGG AGAATTTGGA AAGAAAATGC 
78501 AGAGCCAGGA GTAAAGACAA AAAGTGAAGT TTTATGAGTT TGCCTCTTTC 
78551 TCTTGCTCTT CTTCCTTTTA CTTCTGTGAA TATTTGCAGA AAATTACAGT 
78601 CCTGTTCTAC GTTTTGAGAG AGGATGTAGA TCTCCCAACT CTGAACTTGT 
7 8651 CACAGCCGCC ATTTCCCACT GCCAACCACC TATGGATTTC CTATAAGCAG 
78701 AGTTCATTCA TTGTGAAAAT TAGATAATTG AAACAATCTA TCTGCTTCAG 
78751 GCTGCTGGCT TCAGGGGTGA CACTCACCCG TTGCCATGAC AAATAATTTT 
78801 CAGAGTGAGA TTTTAGCCAC CATAACTCAC TGAACAATAT GTTGTTGCCC 
78851 TGTTCTGTTT ATCAGGAAGA GAGGATTTTT TTTTTTCCAA AATGAAATTA 
78901 ATTAATTATT TTTATTGGCT CCTGGAGATG ATAAAGCAAC CTCCTGCCAA 
78951 TTCATCTTCC AGCCAGCCTC CTCCCTTTTG CACTATGATC CTCTCCCTTG 
79001 CTCCAAAAGA AGACATGATA AAAGAGTGCA AGGCTGTCGA TGGTGCTTGT 
79051 CTGGGTTCAT TCAGTCACGC CTTCCGGGTG GCAGAATATT GATTATGGTG 
79101 TGAATCCTGA AGGGATTAAA TCAGATCAGG AGAGAAATCC ATACCTCCTG 
79151 TTGCCTCACA AATTTGCATG CAGCACCCTC ATCCAGGGGT TAATTACATT 
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79201 TCTGTATCTT TCTAATCAGA AAAGTCTTCA AAGGAGATGG GCACAGGCTT 
79251 GGAATAATCT TCCTCTGAAT TTCATAGGAT TAAAAGCAGT CAGCAAAGCT 
793 01 AGGGCTGTGA AGCCGAGACC TCAGCTTCTG TTGATCTCTA CCTAGTCTCT 
79351 ATGTCTGGAG GGTGTGTGGC CCATCAGGTG CACTCAAGAA ATAATTGCTG 
79401 AACAAATAAA TGAATGCCTG AGTGAAATGA CTGTCTCACT TCACTTTAGA 
79451 CACGTTGAAT TCCAGTGTCC TCCCCTTTTT CCTTTTCATC AAAATCTGCT 
79501 CTTACAATTC TGTTTCCATT TATGCATAAG TGCTTGAAAG CTTAAAGAAG 
79551 CATTTCCCAA AGCGTGAGCA CAAGCACTTC TAACCTGTTC CACGGAAAGA 
79601 AGGTTGTATG TTCACATGAG TTTGGGAAAT TGTGCATGCT AGATTCTATT 
79651 CTTGCTGATT GAAATACTTA CAGGAATACT AAAGGCTCTT CCTCTTGCAG 
79701 ATAAAAAAAA TCTGGCTGAC TCTGAGGAAC TCAAAATTTC CTAAACTTTA 
79751 TTAATTCACT TCACTTAATT TTATTCAATA AACATTAATG AAACATCTAC 
79801 AATATGTCAG GAAATGGTCT AGGTACAGAG AGGGAACAGT CAGATAAGTC 
79851 TCTGCCCTGT GGAATTTACA TGCTAGTGGG AAAGACAGTG CACAGTAAAT 
79901 AAATAAAAAT CTGATGTATC AAATCATGAT TAGTACAATG GAGAAACACA 
79951 GAGAAGGATA GGAACGTTCT GATGGGGAAT CACCCTTTAA GAAAATTTTT 
80001 GAGATAGGGT CTCACTCTGT CACGAGATTG GAGTGCAGTG GTGCAATCAT 
80051 GGCTCACTGC AACCTCAAAC CCCTGGGCTC AAGCAACCCT CCCACTTCAG 
80101 CCTCCCAAAT GGCTGGGACT ATGGATAAGT GCCACCACAC CTAGCAGGAG 
80151 TTGCCATTTT AAGTTGAGTG CTTAGTGGTT ACTCTAAGGT GATGTTTGAG 
80201 CAAAGACCTG AAGGAGGTGA GCAAAGGATC CCCTAGGGGA AAAGACATTC 
80251 AGGTGAGGGA AGCAGCAAGT GCAAAGGCCC TGAGGAAGGG ACATTTCTGG 
80301 AGCATGTGAG GAAACCAAGA TGGCCAGCAC AGTTGGTAGA TCCAGGATAG 
80351 GGAGAAATAT AGGAGAAAGG TCAGTGACGT GACAGTGG^G GGGGCTAGGT 
80401 TGCATAAACT GTATTTAACT ACAAAAGTGT TTTCCTTTTT CTTCAAGTAT 
80451 AATCTCTTAA AATGCTGACT GAGAAGTGCT GCCCTTGCTC TGAATCCTCT 
80501 TTGTCCTCCT GCCCAGTAAA TCACACAATT AAGAAAAGAA AGAAAAAAAA 
80551 CATAAGCAAG AACCTATTTC CTCATCTGTA AAAAGGGGAA ATATAATATT 
80601 ATCTTCCTCT CAAAGTTATT TGAAGGATGA AATAAAATAA ATTTTATATG 
80651 ATGACAGATA TGTGGTTGCA GCTCAATAAA TGTTAGATCT CTGCATCCTC 
80701 TCTCCTTCTC ACTTCATCTC CCACAAACCT CATTGTCAAG CTGCTGTCAT 
80751 CTATAAGCAC CAAGTGCAGA ATCAGGAAAG TCATAGCAGG CCGGGTGTGG 
80801 TGGTTCACGC CTGTAATCTC AGCATTTTGG GTGACTGAGG CGAGCGGATG 
80851 TCAGGGGTTT GGATCACGAG GTCATTATCA CGAGGTCAGG AGTTTTACCA 
80901 ACGTGGTGAA ACCCTGTCTG TACTAAAAAT ACAAAAAAAT TAGCCAGGCA 
80951 TGGTGGCACG CACCTGTAAT CCCAGATACT CAGGAGGCTG AGGCAGGAGA 
81001 ATCACTTCAA CCCGGGAGGC GGAGGTGGCA GGGAGCCGAG ATTGCGCCAC 
81051 TGCACTCCAG CCTGGCAACA GAGCGAGATT CTGTCTCAAA AAAAAAGTCA 
81101 TATCAGAGCA ACCACTGAAG GTGGACAGAA ACAAACCAAA ACTGTAGTGC 
81151 ACAATATGCA GTAGATTTCA. CTAATTAAAG ATGAGGAGAA AAAAGTTCCA 
81201 ATGCCAAAAG AATGAAGAAA AGAGTAGATG AACCAGCAGG GATCAGGCCT 
81251 GTGGGCTGTG CACGCAGCAC GGTGTTTCAG AACAAGATGC AGGGGAGCCA 
81301 CAGGGCACAG AGAGTGAGGA CGACGCCTGG CATTTCCAGA GAGACTTCTT 
81351 GGAATGGTGA AGGTCAGAGG TGTCTGAGGG ATTCATGGGA AGAAGACTGG 
81401 GCATTTTTGA CATCTCATGT TGAACAGTAT TAAATGCCCT GGCTGAAAAT 
81451 GTGGGTATAT ATAGATTCTG ACAGCAAAAG ACTGGCAACC TCAGGTAAAA 
81501 TTAGAGTTTC TGTTTACTCA TATCATAATG AATACATTAT AAGACTCTGT 
81551 CAATCTGATA GTTATGAAAA TGTACCTCAT TGATATTTTA TGCGATATCT 
81601 CCCCAATTAA TGTCATTGAG CATCTTGTCA TGTTTGTTGG AAATTGGTGT 
81651 TTAATTCATT GACATTTATC GAGGTTACTT ATATGCCTGC TCTTATTTCT 
81701 TTCATGTAGT GCTTCCATTT TATTTCACTC TTGCTCTATT GATAATTATT 
81751 TTTAGTGTTG CCAGTCTTAT GGTATTACAA GCAATATTTC AATGAATAAT 
81801/ CTCCCATCCA TGTCATCACG TCCATGTTTG GGAATATTTG TTAGGTAAAT 
81851 CCCTAGAAGC AAATAACTGG AGCCAAAGTA TGTGTATTTA TAATATTTTT 
81901 TAGGAAATGT ATTTTATTCT GTAGTAGGCA ATAGAAGAAC ATATTATATA 
81951 TTTTTAAAAG GATCAAAGGG TTAAATTGTG AAAATTAAGT CTTAAGTCAA 
82001 CCTCTTCTCA CCTAGCCACC CGTAAATATT GCTACCAGCT TGTAAGAAAT 
82051 CCCTCAGAAA TATTCCATGC ATATATAAGC AACTGTGTAC ACTCAAGAAT 
82101 TTATTTCATC TTGGTGGGTT TGTGTTTTCT TCTGCATTGT- AGTTTATAGC 
82151 AAAGCTATAT AGCTCTTGCC AGGACACTAA ATAAAGCAGA GACACTAGGC 
82201 TTTGATGTGA ATGTTTCTGA TAGGAACATA ATTCCTTGTT ATCTTGTCAT 
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82251 TTAGTATTAA CCATTCACTG TCATCATGAT GCATGTCACC AATAAAGTGA 
82301 TTGTTTCTTT AATTTCAGTG TGCCTAACAC TTTACTCTGG AGGAAAAGCT 
82351 GGCTGTCAAG ATGAGGCCGT GGCCACACAT TTTTTAACTT GCTTCAATCT 
82401 GGATTGCTTA CTTGCTGTAT TTGGTACACT AAGTTATTCT GGGATCCTTT 
82451 GTTGCCACCA TTTTGGGAAT TTTCTGTGCT TCTTACTTTC CTTCCTCATT 
82501 TTCATGAAAA CTTCTTGTAG TTTTCAGAGA AAGGTGTGTA TAGGAGATAA 
82551 ACAACTTGAG ATCTTCTTAG TTACAACATA GTTTCAATTT ATTTTTATTC 
82601 TTGATTGATA GTTTAGCTGG GTATTAAATT CTACCCTGGT GACTGTTTTT 

82 651 CTTCAGACTT TTGAAAGCAT TAATACAGCT TCCAGTATCA CTTTTGAGAA 
82701 CTCAGAAGAC ACTTGTCTTT GTTTCCTTTT ATCTTGAAGT GTTTAAAATC 
82751 TTTCTAATAT CTCCAATAAG CTAAATTGTC ACAATGTTAT GCCTTGGTTT 
82801 ATTACTCATT TGCTGGGTTT TTGTGAGACT TTTATTTTCT TTTTTTCTTG 
82851 AGACAGAGTC TCACTCTGTT GCCAAGGCTG GAGTGCAGTG GCTTGATCTC 
82901 AGCTCACTGC AACTTCCGCC TCCTGGGTTC AAGTGATTCT CATGCCTCAG 
82951 CCTCTGGTGT AGCTAGGATT ACAGGTGGGC ACCATCACAC TCAGCTAATT 

83 001 TTTTCTATTT GCAGTAGAGG GTTTTGCCAT GTTGGTCAGG CTGGTCTTGA 
83051 ACTCCTGGCC TCAAGTGATC CGCCCACCTC AGCCTCCCAA AGTGCTGGAA 
83101 TTATAGGC AT GAGCCACTGC ACTGGACCTT GTGAGACTTT TCAATATGAA 
83151 AACTGTCTTT TAATTTGGGG AATTTTTTTG AACTAGTTTA TTCATGATTC 
83201 CTTCTTGTCT ATTTTCTTTA TTTTCTCTTT CAGGAATGGT TATAACTTAA 
83251 ATTTGTTTTT CTGGATTGAA TCTTCCGTTT TCTAATCTCT TGTTTCTTAG 
83301 TTTTATGTTA GTACTCTATT TTCTGGAAAT TTCTTCAAGT TTATATTTCA 
83351 TCTCTCTATA AGTTTTGATT TCTGCCACCA TACTTTTGAT TTCTAAGAGT 
83401 TGCTCTAATT TGTGTTCTGA ATTTTCTCTT TTAGAACATC TTATTCTTGC 
83451 TTCATTTGCT TTAGAATTAA CCATCACACT CTTTTTTATC CTCTTTATAG 
83501 AAAAAATGAT CTCCAGATTT CTACTTGGTT TGTGAGGGGC CATAATTGAA 
83551 TGACGTAGAG TTGGAGATGA TATTTAGAGA CATGATTACT TTTTGACTTG 
83601 ATTCTTCCTC ATTTTAAGTG CCCCCCTTTA CTGCTAACTT CATAAATATC 
83651 TGACGCTGTC AGTTCCTGTG ACTTTTGAGG ATTATCTGGT ATAAATTAGG 
83701 CTGATTCTTT ACTTTCTTCA CTGTAGACTC AGAACTTTGT TGTCTTTAAT 
83751 TAATTTTCTT TCCAGTTTCC AAAATATTTT TGCTTTTGCT AATTTTTTTA 
83801 TTCTCATTCT CTTTGTGGTT ACATATCTTT TTATCCTTTT GTTTTAAATT 
83851 TAGAGTTGTT TTAAGAAGGT GTTATGACTG AATGTTTGTC TTGCCTAAAT 
83901 TCACATGTTG AAGTTCTAAC CTCCAATGTG ATGGTATTTG GAGGTGGGAA 
83951 CGTTGGGAGG TAATTAGTTT TGGACAAGGT CATTAGGATG AGGTCCCAAT 
84001 GATAGAATTA TAAAGTCCTT ATTAAAAAAG GAAGAAAAAC CAGGCCTCAC 
84051 TGTTTTCATG TGAGCATACA GTGTAAAGAC TGCTATCTGT AAGCCAGGAA 
84101 CAGGTCCTCA CTACTTACTA AATTTCTAGC ACCTTGATCT TGGACTTCCC 
84151 AGTCTCATAG TCTACCCAGG ACTGTGATGA ATGAATGGCC ACCCAGTCTA 
84201 TGGAATTTTG TTATAGCAGC CTGAGCTGAC TAATACAAAG GATGTAAAAT 
84251 TAGATTTGTA TTTTTAATCC ATCATCTTAA CCTGGAACCT CCTTGACATC 
84301 TTGGTTTTTG GGGGGAGAGC GGGAATTGAG GAAAGTCCAG AGCATAAGAT 
84351 ACTGCTGAAA GGAGAACACT AAAATATGTT TAGGGTAGGT GCACCTGGGG 
84401 AGTCCAGGGG AAGCATGAAG ACCCTTGTTA TCTTTTTTTT TTTTTTTTTT 
84451 TTTTTTTTTT TTTTTTTTTT GAGATGGAGT CTCACTCTGT CGCCCAGGCT 
84501 GGAGTGCAGT GGCGCGATCT CGGCTCACTG CAAGCTCAGC GTCCCGGGTT 
84551 CACGCCATTC TCCTGCCTCA GCCTCCCGAG TAGCTGGGAC TACAGGTGCC 
84601 CGCCACCGCG CCCGGCTAAT TTTTTTTGTA TTTTTATTAG AGACGGGGTT 
84651 TCACCGTGTT AGTCAGGATG ATCTCGATCT CCTAACCTCC TGATCCGCCC 
84701 ACCTCAGCCT CCCAAAGTGC TGGGATGACA GGCGTGAGCC ACCGCGCCCG 
84751 GCCGACCCTT GTTATCTTTC AGGGATGAGA AAGAGCAACG GGACTCATTG 
84801 AGGGTAAGCA TCACCTGCTT CAGCCTTTGA TTTCTTAGCA TTCCAGAGCC 
84851 AAGGTATGGC TTCACGACAT TCAGAAAAGT GAGGGAGACA GGATGTTTTC 
84901 TGCTGTGGAG ATGCATCAAA GAAGACTCCA TTGCCAGTTT TAAGAGTGGC 
84951 TAAATTTAAA ACGGGGTGTA TTCTGTGTAG GCGTGTGCTG GTAATGAATG 
85001 TCTCAGCATG TAATTGCACA GTTCCTTGTG GCATATTTTT AAGTACTGTA 
85051 TTTGCAGATG ACCTTTTACA GAGGTAAATT GCATATAAGG GGCTGTAAGT 
85101 GCCACATTTA ATAATGCTTC AGGGACATTA GGAAGAAAGT AACAGTTATT 
85151 ATTCGCCAAT AGCCTGCCAT TATTAACTTA CAGGCAGGTT ACTGAGCAGT 
852 01 GACCAACACT CAGCAACAGA TGTCTCCTTC TCCAGCTTTA AGAACTGATC 
85251 ACAACTAGCC AAGGCATATG CAAGATGTTA AGGGAAAAAA GTAGAAAAAA 
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85301 TAAGATAGGA AAAGAAAAAT GTCTGGACTA GGAAAATCTA AATGACTCAC 
85351 CTTAAAAACG TATACTGATA AATGTTTCAC TTTTACAAAG GCACAACAAT 
85401 CTTTGAACTG GCCTGAGGTG TAGGGAATCC TAGGTCTGTA CCTGAGTTGG 
. 85451 TTCATTTCCA GTTTTCTCCA TTACACACAC TGAGGGTCAT TTTTCTGAAG 
85501 ATTTCCTGTT TAGCCCATCT CATATACAGT AACTAAATTA AAACATCACT 
85551 GTCATTACTA GTGAAAAACT TTGAAGTGGC TTTTTTTTTC TATCTAGAGA 
85601 ATTAAGTGAA AATTCTTTGA TCTCATCATC AAGACCCCAT AGTGTGAACT 
85651 TTGCAAGACG ACACCCCCGA CTCCCGACTC CTTACTTCTA GGGGTTTCTG 
85701 TTTAGGGCAG ATTGCTCTCC TTTTTGTCTT TTAATGTAAT ATGTTCATTC 
85751 CCAATTTCCT ACTTTTGTTG TAATATTCTG ACCTTCCTTC TGAATTATTC 
85801 TACCCCAGGC ATCAACAAAC TTCTGTAAAG GGTCAGATAA TAAGCATTTT 
85851 CAGCTTTGTG GGCTACATGC TCTCTGTTGC AACTATTAGA TTCTATTCTG 
. 85901 CTGTTGTATG AAAGCAGCAT AGATAACATG CAAACAAAGG GGTGGGGCCG 
85951 GATTAGATTT TTAGTTCTTG TAGTTTACTG ACCTCTGCTT TAGCCTCCTG 
86001 AATCTTGCTA TTCTTGCCAT GAAAATCCCA AACTCCATGA ACTCCATGAG 
86051 ACCATCCCAA CACATCCTCT TGTCAGGTCC CTGAATGTGC CAAGACTTCC 
86101 ACATCATTGC CTTGAGAGTT GCTGCCCAGT CTCAGTCTGG CTAGTGTGCT 
86151 TCTCTGACTT TCTTTTAAGG GGTTTCTTTT TCTCAGTTTC CTGGTCTTAT 
86201 GTCGGAAGCT GAACCACTAC TGGAACTGAG GGTTCCTTTG TTTGTTTCTC 
86251 TCTTGTTGTC TTTCCTATAG TATTTTCTTT ACCCTATCAT GATACTTAAA 
86301 TATATCACAC TAGATCAAAT CAAAGACAGT TTAAGAGCTT GGGTCACTAC 
86351 CATCCCAAGT CCATTATTAT TAGTTCTCAG TTCAGCATTC TTTCCTCACT 
86401 TACATCAGAC CCAGAGAGAC ACAGAAATAG AATCTCCTTG GGGTTGCCAA 
86451 AGATTGGCAC AGAGATCAAG TCAGAGATGG AGGTGGGGAA AAATTTAGGT 
86501 AAAGATTTCT CTCTTCCTTT CATCATCTTT CAATTAAACA CCACCAACAA 
86551 TAAAACAAAT ATGAAGTACA CAAGAAAATC CTCGCTAATC TATTTTTGAA 
86601 AGAATGCTTT TCTTCTATGC TTTCTCCTGC CACCTGAGAG ATAAAATGAT 
86651 TGAATTAATG AATCAGTATA GACCAATAGT AATCATGGAT GTGTTTAGTC 
86701 AGATCTGGCT TCTTTGCAAT TCTGGGTTTC TAGTAAAATA TTCAGGCTTA 
86751 TTTTCCCATT CTGAGTTCTC AAATTTTTAT GTGACATCTT GAGTATTTGG 
86801 ACTCAGGTAG TAACAAATGT TATGTTTGCT GGGTGAGATG TTCAATGTGA 
86851 GTAGTCATGG CCGGGAAAAC CTTACGGTTC CATTTTTCCT GTTAGCAAGC 
86901 TCTCCAGATA TTCTATGAGA ACTTACAGAA AGTAAATTTC AATAGAGACA 
86951 TCCACACCCA CGAGGTTTAA AAACATGAAG CATAAACATA TGAGAGTCAA 
87001 TATGATTTAT TCTTCACTTT AGCTATTAGA TAAAATGTGA ATATGAAATT 
87051 AGCAAATTAA ATTAGATTGA TATCTCATCT GGTTAGTAAC ATCATATTAA 
87101 TAGTTCAGTT AAGACCATTA ATTATTTTTA TACCCCATAT GACTAGTTCT 
87151 TGTTTATAAG ATGGCCTCTA AGGAAAGGAA ATGCAGAGCC ATCCATTACC 
87201 AGCAGGCAGA GAGTGTGTGA CTTTTTAAGC ACCCAGTGGT GTGTGTGGGC 
87251 CACATGAACA CTTGATTAGG AGTTCAGGTC ATCATTAAAG TCACAGAGAC 
87301 AGGATGTGCT GTACAAGAAG GACAGCTGTT ATCGTAATAG TTAGAACAAG 
87351 GCACAAGAAT CCCTCATCTG ACATCCTCCT GGGTGTTTTC TATCACAACT 
87401 CCCACCTGCT GCCTCACTTC TTATCATATC ATTACTTAGA AAAATGATTG 
87451 TAATGTGACT TGAGGTGAAG CCTTTTGAAT TTTCATTAAA ATAAAAAAAA 
87501 TCATTATAAA AGAAAAAAAC ACATTGTATA TTCAGTGACA CCTCTCTCAC 
87551 TTTTTCTTGA AAAGGACATG AATAATTTGG GAAAAATAAA TTCATTATGT 
87601 CATACTGGGA TTTCCATATT AAATTATATT GTTTTGAAAC CATATGGAAC 
87651 TGCCATTTAG ACAGTCAAAA AGAGTTGAAA ATTAGCAATT TCATATGGTT 
87701 CAACTTAATA AGAGCTGTTA CTCATTGATC ATTGTTTGTG TCAGAAAATC 
87751 TATGAAAAAA TCCTTATTTC CTTATTAATC TTCATAATCA TCACAAGTAG 
87801 TGATAGTAAA TCTCCATCAG TTCCCATCTT CCTACCTTTG CCAAAGCCGC 
87851 CCCACATGGA CCTTCTTAGG CAAATGAAGT TTCAAGATCT GTCCTCCATC 
87901 CCATTCCCCG CCCTGCACTT CCTACCCTAG CCCACACTTA GAATTGCCTC 
87951 TCCTAAACTC TATTTTATGG AAATATTTGT GATGGTACTG CCTAAGACAC 
88001 AGGCGTTGTT ATCGCACACG CTTAACAGAT TGGGAAATGG AGGCTTAGGG 
88051 AATCAGGAAG GAAGACAATT GGAGCAGCAA TTAGGTAGGA TTTGAACCCA 
88101 GGTCATCTTG GCTTAAAATT CTGAGCTTCT TCAACAATTG CCATAGTATC 
88151 TCTATGCTTG ATGACTCAAT CACCTGATAT AAAGCACTGC GAAGGTGACC 
88201 TAACACATTT GGGCCAAATT GCTAGAATTA ACCTTATTCA TATAGTCATG 
88251 ACAGATCAAT AGAAATATTA CTTTGTCCTT GTCCTTATGT AATGTCAAAG 
88301 ATCGATGAAA CCTCAGAAAT GACCTGATGG TTCAGTGTCC CTAGGGGAAT 
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88351 TTGCTACAAG TGAAATATTC CGGACCCCTT CTCCAAAGAT TCCAAATCTA 
88401 AGTCAGGGAT AAGTCCACAA AGATTTACTT TTCTATAACC ACCTCAGGTG 
88451 AGCCCAGTGA GAAACTTCAG GAAAATCTTG GATAAATGTT AACCTAGTCC 
88501 AGTTGCCTTA TTTTATTTAT TTTTAAAAAA TCTTTTTCAC TTAATATTTT 
88551 TTTCTTCAGG TCAGATGGGT AATGTGCCCA TTCGAGGGTG GCACATCTCA 
88601 CACACGTGCA TGAACACCCG ATCATCTATG CTCATGAAGT ACAAAAGGAT 
88651 TGATTGCCTT ATTTTATAGA TGAAGAAAGT CACTTGAGAT GTGAAGCTAT 
88701 TTAACTTACT GCTCTGTTGG TACCATTGGT ACTAGGACTA AATTTAGGTC 
88751 TCTGAACTCA TAATCAAGTG GACTGATTCC TAAACTTTAA CCACAGACCC 
88801 CAGAAATGCA GGTGTGTGGT AAGCGTTATT AAAGGTTAAA TAAATAACAT 
88851 GAGTCACACA CACAGTATTT ATCAACAAGC ACAGCGATGG CCTTTTGATT 
88901 GAAAGTGGAT CTGTTATTAG GCCTCCACTT GCCAGCTAGC CTAGCTCAGC 
88951 TGGGGACACC AGAAGTCAGT TCAGGCCTAA AAGTAATCAA ACTGAAGACT 
89001 GGTTGAAACT TCTAATGAAA TCTTCAAACA GAACCTTCCC CAGAAGTCTA 
89051 GTCTAGTTGT TCCAAACCAG ACTAGTCTGG GTTGGAAATA CCAACTTCAG 
89101 GTTTTTAGAG TAAACATGTA AACAATGTCC TCTGTAGAAA TGGACCTAAA 
89151 AGAAGCCTGA TCTTTGGACC TCCTAAGGGA TGCTATGTGT TTGGACAAGG 
89201 TCTTGGATTT ATCCAATCAA ATGCTTACTT GAGTGGTTTT GTAGTAATTC 
89251 CTTGTATGTG GTGAAAAGGC GGAGTGAACA AGAATCTCCA GATCAGAGAT 
89301 ATGGGGAGGG GACAGGAGAG TATAATGAAT TCCCTCCTAA TACACACTAG 
89351 TACCATGTGG ATGGCAGCTG TGAGCTCCTT GCACTTGGTG AAATGGTCAC 
89401 CTGGTGTGTG ATGCAGTACC TGCACCTCCA TGCAAGACCG TAGACGTGGG 
89451 GAGCCTCTAC TTCCATCTGG GAGCATGCTA TATTGCCTCT GTCAATAACT 
89501 ATCTCTATAA ATTTTAAAGT TTCGGTTCAG ATATAACTTT TGCATCCTTT 
89551 ATCTGTGGCT GTTGGTGTCT TTCCATCAAT CATGTCCCAT GGCCCTGTTG 
89601 TGATAACCAT GCACCCACAG TCCTCAGTTC ATGTAAGCAT ACACAATGAA 
89651 TAGTAATGAA GATGATGCTT TGGACATGCC ACGTCTATTT CTGCAAGTCT 
89701 CTATAAATCC AAGTCAAGGT CCTAAGGTGC CTGCAGGTGC CCCACAGAAT 
89751 TCTCTTCTGA AGCCAGTTCT TCAGCCTCCA TATCTATTTA GTCTTTGTTT 
89801 CATCCTTCTC TGCCCTCTTG ATGTTGTTAG TCTATTTAAC AGGCATCTAT 
89851 TATTTAGGCA TCTAAGCCTC CATTCCATCT GGAAAGCTGT CTACGCTTTT 
89901 GTGAAGACCT GGATGATCCC TGTCCACTGC CCACAGTCAA GGGTGAGGGG . 
89951 CGGCTCAGCA GGCATATCCC TGCTCTTACC TGCTGCCTCC CTTCCCTGCT 
90001 TGTTGGTTGA TTGTGCTCAG TGCCCATCGA GGTAGGGCTT CAGCATGCCC 
90051 TGATGTCCGC AATTCTCTCT CAACCAAGAC TTCAGCACAA CACAAGGAGT 
90101 GTTTTGTTTT GTTTTTTGTT TGTTTGTTTT TGAGATGGAG TTTTGCTCTT 
90151 GATGCCTAGG CTGGAGTGCT ATGGCATGAT CTCAGCTTAT TGCAACTTTT 
90201 GCCTCCCAGG TTCAAGCAAT TCTCCTGCCT CAGCCTCCTG AGTAGCTGGG 
90251 ATTACAGGTG CCTGCCACCA CGCCTGGCTA ATTTTTTGTA TTTTTAGTAG 
90301 AAATGGGGTT TCACCATGTT GGCTAGGCTG GTCTCGAGTT CTTGATCTCA 
90351 GGTGATCCAC CCGCTTCAGC CTCCCAAAGT GCTGGGATTG CAGGCATGAG 
90401 CCACCACGCT CGGCCAGCAC GAGGAGTTTT GAGGTATATT TAAAACCAGA 
90451 AAGAACTAAT ACAGGTGCTC ACAGAATCAA CTGTAACATG ATTTTATCTA 
90501 AAAATCTGCC CTTTTCAATT CCAACATTTG ATTTTCGGGC AAATCAGTTA 
90551 TTTCTTACGA AATTAAAAAC TGAATTCTCA TAGGATTTCA GAAATGACTC 
90601 TGATTTCAAA GTGCAAACTG CTTGAAAGCC TGAACTAACC ACTTAACATG 
90651 AACTACATCC CAAAACTCTA ATTCAGAAAT ATTATGTTAT CTCCTTTGTA 
90701 AAAATCTTCC TGTGGTGTGG TACTTAGCAC TGGGGTCACT TCTGACCACC 
90751 AATATGGAGA AATGAAACAT TTTTTATGTG ATTATCTTCT TTTACTCATC 
90801 CTTGCTCTTA AACGCTGTTT CTCTTTGATG CCCTATGTTT ATTTTAGATC ' 
90851 ATTTTCCAAG TCTTTTAATG TTTTCTAATC CTGCCCCTGC CCTCCTTATG 
90901 TGACTATCTA AAACTAAGCA AAACAAAGCC CACCTAATTC ATATTAAAAT 
90951 GTATTGCAC5A CCAATGACTT ACCCATATAT CAAGGAATTA TTATAGAATA 
9W01 AAATATATAA ACATAAAGAA CTCAGATCAG ATAGTCTTGG ATGGATTATA 
91051 AGGCATTATT TTGATTTTAT GTGGCCTGAC ATGATTATTG TGAGCACTAA 
91101 TGATCATTTT TCTAATATTG GGTGACAACA TGATTACTTG GTTTTGTTGT 
91151 ATGCATCGTA AACAGATCCT TTAAAAAGCA AAACCTTGCA CTGACAAAGT 
91201 GAATTAGACA GTTTGTAAGT ATGAAAATAA AGAAACATTT ATATAGCTTA 
91251 AAATATCCCT AGTAAAGAGC CTATTTCTGA GAGAAGGGTA AAAATTTTTG 
91301 AGAACAGAAT TCTTTGAAAA ACATAATCGT GCTAGGGAGG AAAGATGCCA 
91351 GATTCTCTGG AAT.TTTTGAA CTCAGATATA AACTGACTGA TCTAAATATA 
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91401 GGGAGTTACT TTGTAAATTG AAAACGCATT AGGACATTTT ACCCCCACTC 
91451 CCATGGGAAT GCCTTGTCTG GAATAGATGT GCATAGGGCC ATATTTTGTT 
91501 TGGTTTGAAA GGAGCGTGAA GATTGAAGGG CATTTTGGTC CTGCTTTGAT 
91551 TATTGTAATG CCAACACACT TTTGTTTGAT GCAAGCCTGG TGACTCCTGG 
91601 CGAGAAGGGA ATTATGCTTG TGGAACGAGA GATTTTCTGG ATGAAGAAAA 
91651 GAACACGACA GTCTCAGTTT TCTGTCACAT TCTGAATTAA ATGTAAAAAT 
91701 ACATGTTTGT CCTTCCTAGA ATGCTGCATC AACTCTAGGT GATTTAGATA 
91751 CAGTTTAGAG AAGACCCTTC AAATGCCAAT GACCTTCCAG AATGCATTAC 
91801 ATCTTAATTT AGAGTAACAC TAGCAGACAT GGAGAAAAAT TAGTTGGGTT 
91851 TACAAAATAG CCAAACCAGG TTACTGGGAT CCTGAGGGGC AACAAAAAGC 
91901 CCTGGCTTTG CCTTCATCTT AGTTTACACA ACACATTCCA GTTTCATGGA 
91951 TCTATAAATA CCATTTGACT TTCTCAAATA ATAAATTAGC GGCTCCCAAG 
92001 AGGAAATGAA AATAAAAAAA GTCAAGTGAT TCTTTTGCCT CAGCCTCCTG 
92051 AGTAGCTGGG ATTACAGGCA TATGCCACCA CACCGGGCTA ATTTTGTATT 
92101 TTTAGTAGAG ATGGGGTTTC TCCATGTTGG ACAGGCTGGC CTCGAACTCC 
92151 TGACCTCQGG TGATCCACCC GTCTCAGCCT CCCAAAGTGA TGGGATTACA 
92201 GGTATGAGTC ACTGCGCTTA GCCTGAAAAT TAAAAGATTT TGAAGCTTCT 
92251 CATGGAGAAA GAATTATAAA AGTTCAAAGC TAGAAGGGGA AAAAAGAAAC 
92301 AATACATCGA TTAGATGCAA CTAGCACAGA TCAGAACCAA TTCTTGTTTA 
92351 ATTCTCCCAC CACCCTCCAG CTTCATGTTG CTATCCTCCT CTGGCCAATG 
92401 AACCTGAAGG TCAGAGTAGA TTTCCAAAAG TCGCAGAGCT ATTAAGCATT 
92451 CCAGACGACC CTTCATTTTC TACTACCTCT ATCTGATAGG ACACAGAATT 
92501 AAGAGAAACT CCCAGGGCAG GGTCAAATAT TTCCACCAGG ACTGCCGAGA 
92551 GCCTTGGTTT CCTCTAAAAC AAATACACTG GCTTTTCACA GCCAAACAGA 
92601 CCTTGTATTC AGGGCTAAGA GATGAGTTCT AAGTATCTGT GTGGTCCCAA 
92651 GCATACCGTT TTCTGGGTGG CCCACTCAAC ATGACAGAGC CTCCAGTGAA 
92701 GTTTCTCTCC TTTAATTAAT TTCATTAGTT TTGTCTGGTA GTCTAGTTCT 
92751 ACCCTGTGGA CTATTTATGG GGTCCTGGTT CCTGTCTAAC TCTTTCCCCT 
92801 ATCTCATCCC CCAGGGACCC AGGGCCTTTG CTTTGCTAAT TCTCAACCAC 
92851 CCCCACATTT CCACTGAAGA GCTCAGAATA TCTTGAGCCC TTGTCTGAGA 
92901 ACCTCAAATA TCTGAATGTC TTCTCCTCCA ATCTCTGTCA TTTAGTCCAC 
92951 TATGCCTGAA AACTTCTTAA TTCCTTTCAG AAAGTAAGAC CATGCATGTT 
93001 TGAAGACTCT GACCTTTAGA AATTGCATCC TTATATCAAA CCCAAATCCT 
93051 TCTTCCTGTG GCTATTACCT AAGTCATCCT GGTTCTTAAA TTGACTTTTT 
93101 TTTTCTTTTT CTGAAGGCTC ACAGCAAGTA GGTGACCAAG TTAAGCGCCA 
93151 TTTCTTACAG CAAATGAAAA ATATCTGGAA TTGATTCAAG TTCCTAGGCT 
93201 CAAGATGACA TTCATTCATT TACGCAAAAT ACATTTTTTT GAGTGACTAA 
93251 TAGTACCCAC TGTCCACATC AGTGGGCAAA AGCCATGAGA AATAAGTCTG 
.93301 ATCTCTCTTC CCTAGGAAAA AACCTTTAAA TCGTCAAAGA AGACAGTTAT 
93351 CTTTTGACCA CTGAGTCATC TCCAGGCTTA CAGGCTTCAT GTATCTAGTT 
93401 CTTTCTGTCT GTCIGTATGAC CTCTTCTTTT AGAGGAGGTC TCATTAGAAT 
93451 ATAAGTACCA AAAAGCTATG GACTACATTG TGTTCCTCTT TACATACACT 
93501 GTGGCTAGCA AATTGTTAAT GCTTAATACG TAGCTTTAAA AATAATATGA 
93551 ATACATCATG ATTATTTCTG GAACTCTTCT AATCTGAGCT GAAATTTGAA 
93601 CTGAGTCTCT TTTAGCTAGA ATTTACAAAC TTACTTCCTG AATGAAATGG 
93651 AATCAGTGAA ATAGAAATCA TTTTACATTG AAAGGGAAAA GATTGATCAT 
93701 AACGTGATGT TTTAAGCAAA CAACAACAAA ATTAGAACAT AAATGGTCTA 
93751 ATCAGGTCAG AAGTAGGCAT TACAAGAAAA AGTAAGCTTA CAACTGCAGG 
93801 CAAATCAGAG ATTCTGGAAC ACATAGGTAT TAGGCTGATT TTGCATTGCT 
93851 ATAAAGAAAT ACCTGAGCCT GGGTAATTTA TAAGAAAAGA GGTTTAATTG 
93901 GCCCACAGTT TTGCAGGCTT TACAAGAAGC ATGGCACTGG CATCTGCTCA 
93951 GCTTCTAGGG AGGCCTCAGG AAGCTTACAA TCATGGCAGA AGGTGAAGGG 
94001 GGAGCAGGCA CCTCATACGG CGAAAGCAGG AGCAAGAGAG AGAAGTGGGG 
94051 GCGTGCCACA CACTGTCACA TGACCAGGTC TCACGAGAAC TCACTATTGT 
94101 GAGGACAGCA CCAAGGGGAT GGCGTTAAAC TATTCGTAAG AAATTAACTC 
94151 CCACGATCCA ATCACCTGCC ACCGGGCCCC ACCTCCAATA CTGGGGATGA 
94201 CAATTCAACA TGAGAOTTGG GTAGGAACCA ATACACAAAC TGTATCAATA 
94251 TCTAAAATAA TCAATGAATC CAGGAGCTGG TTTTTTGAAA AGATCAACAA 
94301 AATTGGTAGA CTGCTACCAA GACTAATAAA GAAGAAAAGA GAGAAGAATC 
94351 AAATAGACGC AATAAAAAAT GATAAAGGGG ATATCATCAC CCATCCCACA 
94401 GAAATACAAA CTACCATCAG AGAATACTAT AAACACCTCT AGGCAAATAA 
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94451 ACTAGAAAAT CTGGAAGAAA TGAATAAATT CCTCAACACA TACACCCTCC 
94501 CAAGACTAAA CCAGGAAGAA GTTGAATCTC TGAATAGACC AACAACAGGC 
94551 TCTGAAATTG AGGCCATAAT TAATAGCCTA CCAACCAAAA AAAGTCCAGG 
94601 ACCAGATGGA TTCACAACCA AATTCTACCA GAGGTACAAG GAGGAGCTGG 
94651 TACCATTCCT TCTGAAACTA TTCCAATCAA TACAAAAAGA GGGAATCCTC 
94701 CCTAACTCAT TTTATGAGGC CAGCCTCATC CTGATGCCAA AGCCTGGCAG 
94751 AGACACAACA AAAAGAATTT TAGACCAATA TCCCTGATGA ACATTGATGC 
94801 AAAAATCCTC AATAAAATAC TGGCAAACCG AATCCAGCAG CACATCAAAA 
94851 AGCTTATCCA CCATGATCAA GTGGGCTTCA TCCCTGGGAT GCAAGGCTGC 
94901 TTCAACATAC GAAAATCAAT AAATGTAATC CAGCATATAA ATAGAACCAA 
94951 CAACAAAAAC CACATGATTA TCTCAATAGA TGCAGAAAAG GCCTTTGGCA 
95001 AAATTCAACA GCCCTTCATG CTAAAAACTC TCAATAAATT AGGTATTGAT 
95051 GGGATATATC TCAAAATAAT AAGAGCTATC TATGACAAAC CCACAGCCAA 
95101 TGTCATACTG AGTGGGCAAA AACTGGAAGC ATTCCCTTTA AAAACTGGCA 
95151 CAAGACAGGG ATGCCCTCTC TCACCACTCC TATTCAACAT AGTGTTGGAA 
95201 GTTCTGGCCA GGGCAATTAG GCAGGAGAAG GAAATAAAGG GTATTCAATT 
95251 AGGAAAAGAG GAAGTCAAAT TGTCCCTGTT TGCAGATGAC ATGATTGTAT 
95301 ATCTAGAAAA TCCCATCGTC TCAGCCCAAA ATCTCCTTAA GCTGATCAGC 
95351 AACTTCAGCA AAGTCTCAGG ACACAAAATC AATGTACAAA AATTACAAGC 
95401 ATTCTTATAC ACCAATAACA GACAAACAGA GAGCCAAATC ATGAGTGAAC 
95451 TCCCATTCAC AATTGCTTCA AACAGAATAA AATACTTAGG AATCCAACTT 
95501 ACAAGGGACG GGAAGGACCT CTTCAAGGAG CACTACAAAC CACTGCTCAA 
95551 TGAAATAAAA GAGGGTACAA ACAAATGGAA GAACATTCCA TGCTCATGGG 
95601 TAGGAAGAAT CAATAGCGTG AAAATGACCA TACTGCCCAA GGTAATTTAT 
95651 AGATTCAATG CCATCCCCAT CAAGTTACCA ATGACTTTCT TCACAGAATT 
95701 GGAAAAAACT ACTTTAAAGT TCATATGGAA CCAAAAAAGA GCCCACATTG 
95751 CCAAGTCAAT CCAAAGCCAA AAGAACAAAG CTGGAGGCAT CACGCTACCT 
95801 GACTTCAAAC TATACTACAA GGCTACAGTA ACCAAAACAG CATGGTACTG 
95851 GTACCAAAAC AGAGATATAG ACCAATGGAA CAGAACAGAG CCCTCAGAAA 
95901 TCATGCCACA TATCTACAAC TATCTGATCT TTGAGAAATC TGACAAAAAC 
95951 AAGCAATGGG GAAAGGAGTC CCTGTTTAAT AAATGATTCT GGGAAAACTG 
96001 GCTAGCCATA CGTAGAAAGC TGAAACTGGA TCCCTTCCTT ACACCTTATA 
96051 CAAAAATTAA TTCAAGATGG ATTAAAGATT TAAATGTTAG ACCTAAAACC 
96101 ATCAAAACCC TTGAAGAAAA CCTAGGCAAT ACCATTCAGG ACATAGGCAT 
III 5 * GGGCAAGGAC TTCATGTCTA AAACACCAAA AGCAATGGCA ACAAAAGCCA 
96201 AAATTGACAA ATGGGATCTC ATTAAACTAA AGAGCTTCTG CACAGCAAAA 
96251 GAAACTACCA CCAGAGTGAA CAGGCAACCT ACAGAATGGG AGAAAATTTT 
96301 TGCAATCTAC TCACCTAACA AAGGGCTCAT ATCTAGAATC TACAATGAAC 
96351 TCAAACAAAT TTACAAGAAA AAAACAAAGA ACCCCATCAA AAAGTGGGCG 
96401 AAGGACATGA ACAGACACTT CTCAAAAGAA _ GACATTTATG CAGCCAAAAC 
96451 ACACATGAAA AAATGCTCAT CATCACTGGC CATCAGAGAA ATGCAAATCA 
96501 AAACCACAAT AAGATACCAT CTCACACCAG TTAGAAAGGT GATCATTAAA 
96551 AAGTCAGGAA ACAAACAGGT GCTGGAGAGG ATGTGGAGAA ATAGGAACAC 
96601 TTTGACACTG TTGGTGGGAC CGTAAACTAG TCCACATTGT GGAAGTCAGT 
96651 GTGGCGATTC CTCAGGGATC TAGAACTAGA AATACCATTT GACCCAGCCA 
96701 TTCCATTACT GCGTATATAC CCAAAGGATT ATAAAACATG CTGCTATAAA 
96751 GACACATGCA CATGTATGTT TATAGCGGCA CTATTCACAA TAGCAAAGAC 
96801 TTGGAACCAA GTGAAATGTC CAACAATGAT AGACTGGATT AAGAAAATGT 
96851 GGCACACATA CACCATGGAA TACTATGCGC CATAAAAAAT GATGAGTTCA 
96901 CGTACTTTGT AGGGACATGG ATGAAGCTGG AAACCATCAT TCTCAGCAAA 
96951 CTATCGCAAG GACAAAAAAA CCAAACACCG CATATTCTCA CTCATAGGTG 
97001 GGAATTGAAC AATGAGAACA CATGGACACA GGAAGGGGAA CATCACACAC 
97051 TGGGGTCTGT TTTGTGGTCG GGGGAGCGGA GCGGGGAGGG ATAGCATTAG 
87101 GAGATATACC TAATGTTAAA TGATGAATCA CTGGGTGCAG CACACCAGAC 
97151 TGGCACATGT ATACATATGT AACTAACCTG CACGTTGTGT ACATGTACCC 
97201 TAAAACTAAA GTATAATTAA AAAAAAAAGA ATTAAGGATA TCAGGAGCTT 
97251 TAGGTTTATT TATTCAAATG AAGAATTTCC CAGAAGGTGA TTTGATTCTT 
97301 AGCAGCAGCA TAGCTTAAAG CTCTCACCAG CAGAACCCAC CATTTTCCTA 
97351 TGGATATTTT TCAAATATGT ATTTAGGTTT TTTTGGGTTT TTTTTTTGGG 
97401 AAACGGAGTC TCTCTCTGTC GCCCAGGCTG GAGTGCAGTG GCGTGATCTT 
97451 GGCTTACTGC AAGCTCTGCC TCCCAGGTTC ACACCATTCT CCTGCCTCAG 
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97501 CCTCCCAAGT ACACATTTAG GTTTTTACCT TTGAACTCCT AGTACTTTGT 
97551 GCCAATGCCA ATTAGTTGTG AATTTACTCT TGTACTATCC TCTGGCATCT 
97601 CTTAAACTAT CATTTCCTTT GTGATATTTT ACTTTTCTTG CATGTATTCT 
97651 TTGTTTCTTT AAGCAGAGTA AAGTCATTTT TTGTAAGAAG TGCAATGTCT 
97701 TTCACTTCTT TATGTATCTC ATGAAGTCTG TAAGAATGCT ATTCTACTAA 
97751 TCAGACCGGG CAAACATGCT GTGACACATA GACTCACCGA TACGTAATGG 
97801 CTGAAATGTA ACAGAAGTTT AGATCTTACT CACTTAACCA AGGCTGGCAT 
97851 TCTTGTTCCT GTTCATGAAC AGCTTTTTGC CACGTGGAGA CTCAGAGCCA 
97901 GCTCCTCTCA ATGTGTGGTT CTTCCTATCT TAGGATTTTG TTGTTTTCTA 
97951 CATCGAGCTT GGAGAAGCTG GAAGAGCCTG AAGAGCCAAC TGGTCCTTTT 
98001 TTTTTTTTCT TTTTTTTGAG ATGGAATCTT GCAATCTCAG CTCACTGCAA 
98051 CCTCCGCCTC CCGGGTTCAA ATGATTCTCC TGCCTTAGCC TCCCAAGTAG 
98101 CTGCGATTAC AGATGCCCGC CACCATGCCC AGCTAATTTT TGTATTTTTA 
98151 GTAGAGACAG GGTTTTACCA TGTTGGCCAG GCTGGTCTTG AACTCCTGAC 
98201 CTCATGATCC ACCTGCCTCA GCCTCCCAAA GTTCTGGGAT TACAGGTGTG 
98251 AGCCACTGTG CCTGACCCAT GGTCCTAGTC TTAAGACTTC GCTTACAAAT 
983 01 GCAAACATTA TTTCCAGTCA GTGTTATTGG AGAGATGAGT CATATGGTCA 
98351 CACTAGGGGC ATAGGAGACT GGGGATGTGA TATAGTCCTA TCCTGGGCAA 
98401 CCATTTCCTA GACACAGATT TATACCCTGA AATACAAAAC TGGATTCTGA 
98451 TGGACAGCTA GCTTTCTCTG CTAAGCACAA AGTCGAGTTT AACAAATATT 
98501 TGTTGATCTA TTTATATTAG CAGTAAAGAT CAAATTGCTT TTATTTCTCC 
98551 TTTGAACAGT GCTATTAAAT ATCTCGGAAA TTTTAGAAAA CTCTGTAATG 
98601 ATGTGCAGTA GATTATGCAT TTCTTCAACA ATGATTTGAA AGATTGCTGA 
98651 TGTTTATCTA GGGAGATGTG AAGCAATTTA ATATTATTAA TTACTAACAA 
98701 ACGGATAAGA TTTACAAGAA TTGTCAAAGC TTAAAGCTCC AAATAATCAT 
98751 TTCTCACACA AGAAAATTTA TTATGCCATT ATGCTTATTT CTATCAGCAT 
98801 TCCTAAAGTG CGATCACTGT TTGCTATTAA ATTTTTATTG CACATTTAAT 
98851 TTATCTTTCC CTCTAGTGGG GAAACATACT AGAGAATTCA TTTGACACTA 
98901 GTATCATCTC CCCTTCTTTT TTTTCTCACC ACTTCTGCAC CCCCTGAAAT 
98951 AGATAGGAAT ATATTTGTAT CTTGGTATTA AGTATACAAG AGCTGAAGAG 
99001 AACAATGCCT ACTGCTGCCG GAAGGGAATT ACCCAGCCTG AATTAGCTTT 
99051 ATGAAGAACC AAGTGGACTT CTGTTATAAA TACCCACTTT CGGTGGTTGA 
99101 TAAAGAATTC TGGGAAAAAA TCCATTTATA AGCTAAAGCT TGGTTTTCAG 
99151 CGATTAAAAC AGAAAGCAAT TTTGGTTTTC AAGTATTAAA GGCAGACTGC 
99201 GTGTTAAAAG TTGGGGTTGG GGGGCTATTT AGTTTATAAA GTGCTGTATT 
99251 GAAAATGACG TTTTATGTTT CTGAAGGTAT GCGTTTTATG TGCAAAGCTT 
99301 GACAAACTTG TTCCCCATCC CACTCCCTAC CCTAAGAGTT GAATCAGGAG 
99351 GAAAAC AGTG . ATATAGGATC TGAAAACGCG TTTGTTGTCA CCAAGGGCTT 
99401 TCTCTTAACA GGGCCAGATT CAGTGGAAAC ACGATCAGCC CTGATACAGT 
99451 GGAAGCCTTC TCTTGATTTT CAGAAAGATT TCTAGCTCTT GGGCCTAATT 
99501 CACTAAGACA GAAGTCAGAT TGTGTTATTG ATTAATCTGA AGCTTTAGTC 
99551 TCTCACAGGA TTTGATAACA CACCGCAACT GTTTTCTTCA CACTGAACAA 
99601 CTACCAACGT TTTGTTTGTA TTTTGGGTAA GAGGCTGAAC TTGACAAAAT 
99651 ATCAATGATC CCATTTCTAG TGCCTCAGTG AGGCTATATT TTTTTCCTTT 
99701 GACTTAACAG TGCGGCCTGA ACATCTGGTC TACTGAACTC AGGAGGGAGT 
99751 GCATAGAAGT CTTACCTCTC CCGGGAAGAA TCTGCGAATC CTTCAAAGAG 
99801 TTGTCATTAC AACTTCAATA CACCCTTATT GTCTACTCCT CTAAGAGCAC 
99851 TTTTCCCCTC CATCCCACCC TATGTATTTC ATCTACATAT TTGCCAATGA 
99901 CTATGTTTGT CTGTTTCCTT ACAGACATTG ACACGAATGC CCTGAAAGAC 
99951 AATGAAATTC ATGCTCCAGA AGAAGTAGGA TTTAGAGCTG ACATTTACAG 
100001 AGTTTCCACA GTGGATAGTA GATCAATCAG GGCTTTAGGT GTTGCTGAGT 
100051 AATTTTTCCA ACTTTCTGTT GCTACCTGGC TGTAACAGCC AGAAGAGGAA 
100101 GAGTGACAGC CCCACCAAAA TTGAGCTGCC TTCCTTACTC ACCTTCTGAC 
100151 ATCACCGTCC TGGAACCTCA TGCATTACCC -PCAGCCTTGG CACATTATCT 
100201 GCCTGGCTTC TGCATTGGCC ATGGCATACT CATGCAGCCA TACGCTCTAA 
100251 AGTCTCCTGT CAGCCTCGGA ACCACCAGTG GGGCAAGCTC TCCCTTCCAT 
100301 TACTGATTGC TATAGTATTA ATAGAATGCC CGGGCTCCTG GCATCCCTGG 
100351 CCTAAGGGAC TATACCTCTC TCCACGACCC GGTGCTTAGT AGGTGCTCAA 
100401 TAAATGTTGT CTCCATGTTG TTTAGCACTT GGATTGATCT GTTCCAAAGG 
100451 CGCCAGCCCC TTCCCTTGAA CTGATTTCCA AGTTTCCTCC GCTGTGTGAG 
100501 AATTAACAAT AGCAATGCCA TTTTTCTACC TTGGTGACTG TGTTTCCTAC 
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100551 CTCGGTGACA GTGTTCCCAA AAACAAAGCA AAGGTCAACC TCACTCATAC 
100601 CTTTAACAGA GCATAAGAAC CCTGACCACA ACAAAGCCTG ACTGGCCCTG 
100651 ACTGCATCCC TCCCCAGCCC CTTATGGTTC CAACCTGCCA GCGCTTCCCC 
100701 AACCCCTCCC CTATAGAAAC TGGATTATAA AAACCTCAGC TTACAGGCAA 
100751 TCAGGATCTC AGCCAGATTC CTGGCTGGGA ACTCTCACAG GCTTATTCTT 
100801 GTAAATACAC CTATTTGGCC ATTGAGTTGC CTTGTCTCTC ATTTCCTTCC 
100851 TTTAAAAATC TCCCTAACAC TCTGAGATTT TTCAAGACAG ACAGACACAC 
100901 AAACACACAC ACACACACCC CACATCGGTC TTAAAACTTC CCACTGGTTA 
100951 TAAATAACTG CAGTGTATTC GGACTAGTTC AAAGTAAAGG CGATGATAGG 
101001 GAAGGAGGTG ATTGGTTTAA GAGGAGAAGC AGCCACACAA AGGGGCAGGA 
101051 AGTGGTGCAT CCGTAGCGGA GACTACTCTC TCCATCATTC ATGGGGCCTG 
101101 GGAACTCCCT TCTCAGGCTT TCTCAACACA TTTGTTTCTT TTCTTCTCTG 
101151 AAGACCAACT CTCTCTCTCA AGCACACCCT CTCTGCTTCT CAATTAATCT 
101201 AACTTGCAAG TAACTTTGGT TTGTGTAATA CTGCCTGAAG CCTCGACTCT 
101251 ATACATGAAT GTATGGCTCA AATACACTTC ATCAACTGTG TTCAGTTTTT 
101301 ATGTCCTTTT GTTCAAATGG TTAAATTCAG GCAATGCCGC AAATCTGTCA 
101351 CATGCCAACT GACAGTCTTC TGTATTGAGG ACCAGTTAAA ATTCATTGAT 
101401 ATTATGAAAC TAGTACTGTC ATTTACTAGG AATAGGTGAT TTACTTTATT 
101451 ATTTATTTAT TTATTATTAA ATTTAAATCA TGAATAGTTC CATTAGTTTA 
101501 AAATTTATAA AATAAAAATA ATAAGCACAG AAAGCAGGAA AGATTTTTAC 
101551 CAATGTTTTC AAGTGGAAAC AAAATAAAAG AATTTTTTTT CTGTTTTTCA 
101601 TGTTTCTGAA CAATTTTCCT GGGTAAAATG TAATTTATAT TCTGAATTTA 
101651 AATTAGCTAT ACAATTTGTA TAATAGAAAA TGTTTTTCTC TTCTTTTCTA 
101701 AATAAATGTA GAATAAATAT TTAAGGAATA TTGATTGGAA AGGCCATTGA 
101751. ATGACAACTG TTCTCAGAGT GCCTGCATCT CTTATGCTGG CCACGGCTCC 
101801 CTTGACCCTC AGAGCAAGTT CTGCCACCAC ATTATCTCCA AAAGCTGTGT 
101851 TCAGGAATTT GTCCAGCTTC CCAGGTAAGG GCATTTAACC TCGGCTGCAC 
101901 TGGACTGCTG GCTGCGTCTG TCCCCTGGTC AAGGGAAGCC TTGGATGGAA 
101951 GAGCGGCTGT CACAAAGAGG GGCAGGACAG GCATCGGAAG CAGCATTTGT 
102001 CTTTGCCTCT TTGTCTCTCT GCCAGGATGA TAACTGGCAA CCACGTAGCC 
102051 ACAGAGGACA AGAGACCCCT GCATCTGAGC AGAGGCTGAT CCTTTACTAG 
102101 GCTGTTTTGC TTTTTAATGT GATTTCCTCC CAGCAAGCCA CTTCCCTGCA 
102151 GTCATGTAAT GCCATTCTGA TTTGGTTTGG CAGCGTTAAT CTAAGGAACA 
102201 TGTTTAATAG CTCAGGCATG GTTATTTGCA TGAATTTTTT TTTTTTTTCT 
102251 GATAGCCGCT GTTCCTTTTT AACACTGGGG TCCTTGAGGC TGCCAAGGCC 
102301 TTGCTTTAGC CTGGAGCCCA GATTTCTCCA TGGAGCTGCT . CTGTGTCTGC 
102351 CGTTGCTACT CACCCTGTAT TGAGCACTTG GAGGTGTTCT AATTCAGGAC 
102401 AACTGTCTGC TGTCAGTGAG GAATTAGTGC ATGTGGCCCG ACAGTGGAAG 
102451 GCAGCAGCAC CTCTAGGCAA GCTGTTTTCC TTGCTAACTC ATTGTATGCC 
. 102501 TGTGGGTCAG ATTCCTTCAG TCCCCAGGGC AGTGATTCTC CAGGCGGGAC 
102551 CTGTCTCAGT GTGTAAGTGA TGGATTGCAT TGTTTGAAAA TTCGTGGCCG 
102601 AAGCTGCTGT CAGATGTACT GAGAGCAACT GTCATTGTGG AGGGAGAGTG 
102651 AGCAAAGGGC TCACCTTAGA AGCTGTACGC TTTCCTTTCC AGATTTTGTG 
102701 TCACTTGCTA TTAGCATCAA TAGCCCATGT CCATAGAACC TCTGCCTCTC 
102751 TTGACTTCCC TGATTCTGTC CAAGGCTTTC CAAGCAGCAG TTCATACACG 
102801 CAGTCTTGCT GCCCTCTTTA ATTTTCCCCT TTCCACAGAC TTAGTACCCA 
102851 TAGTCACCCT CAATTTCTTC TGTACAGCAC TACTCTCTGT ACAGCGTTCC 
102901 CTTCCTTAGC TAGAGTCTTC ATCCCACCCA GCAAGCACTG AAATCTAACA 
102951 GTGGGGAGCC AGGAAGAGGA CATTTTTATG CAAATAAGTA AATATCTACA 
103 001 CTTCATTTAC AGTGTGACTC TGAAATAAAG TCTTTTTACC TAAAGGAAGA 
103051 GAGTCAAACT ATTTTCACAA CTAGTCCTGC ATTGATTTCT CTGATGATTT 
103101 GCATGGGGGT TTCTCGGAGC ACTGCTACTG GGTACAGAAG TTCTTATCAC 
103151 TTGTAATTCC TCAAAACATG AAGCATGCCG ATGAGAAATT CGGTATAGGA 
103201 GAATAAAATA AAACAGACAA CAAAGAGAGG ATCATTAAAA TGCAAATCCCf 
103251 AGCTGCTCTA CAGGGTTCAT TTTGTCTCAT CTTCTCTGTG AAACCTTCAT 
103301 CTACTGCTCT AGCTCCATAC ACCGATAGGG TCCTCTTTAA TGTCTACAGA 
103351 TCCTACAACT GTTCTGTGAA ATATGTGACT AAAATTTGTA CAGTGGTAGG 
103401 TCAGTTTCTG GGTCTTTAAA AACTGCCAGC TTTTTCTCTA AATTGGGATG 
103451 AAAACCAAAC TCTAAAGTAT TTAAAGAGGC TTATTCTGAG CCAGTATGAG • 
103501 TGACCATGAC CCAAGAAAGA GTCTCAAGCC ATCCTGAGAA AGTTGCCTAA 
103551 GGTGGTCGGG TTACACTTTG 1 GTTTTATATA ATTTAGGGAG ACAGAAATTG 
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103601 TAGGTAAAAT CATAAATCAA TATGCAGAAG CTATATATTG GTTCAGCCTG 
103651 AAGAGGAGGG ATATCTTGAA GGGGCGGATG TGAGGTCATT AAGTGGATTC 
103701 AAAGATTTTC TGATTGGCAG TTGTTGCAAG AGTTAAGCTT TGTCTAAAGA 
103751 CTTGAAATCA GTAGGAAGAA ATGTTTTAAT TAAGATAAAA GAGGGTGTCA 
103801 AGGCCAAGGT TCTTGTTCTG AAGCCTCCAG TTAGCAGCCT TCGGAGAGAA 
103851 TAGATGGTAA ATGTCTCTTT TTAGAACTTA AATGTGTCAA ACTCTCATTT 
103901 AATTTCTCCT AGCTCTGGGA AAGGTCCGGA AAGGGAAATT CTGGCTGCAT 
103951 TAATGGAGAT TCTCTACAGA TCCAAATTTC CTCCTAAAAA GATCACTTTG 
104001 AAGAGCGATT TCAAGATATG TCAAAGAATA TATTTTGGAG TAAAATATTT 
104051 TGATTTCCTT TAGGGTCTGC TGTCTGTCAC ATGATGCTAT ACTAAAGTCA 
104101 TGATGGAATT TGGTATTGTA TTTCTACAAA GAGTCTATGT GGTCAGTCTT 
104151 ATGATCTCTG TTTTAACGTT AATGCTGGTC AGTTGTACCT AAACTCCAAA 
104201 CAGAGGGGCT ATAATGAGGC GTGTCCAACC TCCCCTCGCT GTCATGGCTG 
104251 GAAATTTAGT TTTTTCAAGT TTCTCTTGGG TCCCCTTGGC CAAGGGTTCA 
104301 TTCAGTCATT TGGGGGGCTT CAGATTTTAT TTTTGCTTTA TAGGGTGTCA 
104351 GGGCCTTTGG GGTGAAGGCC ATGTCTTATC TGGAAAATGC CTAAAAATGG 
104401 AGTAGATTGG GCCTTGAGCA GTTTTTCAGA GGTTCAGTTA CAAACTCTAA 
104451 AGCAAGCATG AGAAAAAGCA CATTATTGGA GTAAGTATGA TTGATGTTGT 
104501 TATGAAGAGT ATTATTGATT TATTCGCATC TCCAAATCAT TCTAGAAAGA 
104551 GAATTGAGAC AGCTTATCTT ATCTATCTTT CTATCTGTAT GCATACAATG 
104601 GGATAAATTG ACAAAATAAA CGTGAGGAAA TGGCAGCAAC GGGAAAATGA 
104651 AATCAGGAAA ATAGGATAAA TAGAGAGGTT TGATAACACA TAATGAACAC 
104701 CGTAAGACAC TATTCAACTG CTGGAGAGGA GCCCCAAATT TGGTTGTAAA 
104751 TTTACCAGCA GCCACAACAA AGGGGAAACA CAATCTGTTG CAAAAATTAT 
104801 GGGGCCTGTA AAATGAAAAC TCAGCTAAGG CAGGAAAACC TGTTCTTTCT 
104-851 GGTTCGGTGA GTCCAGTGAC CCAAGACTCA CAGACTCATG ATTCCTGAGA 
104901 AGAGGCACTG ACCTGGTGCT GTGCACAATG TCATTGCCCA CATTTTTAAA 
104951 AAGAGCCAAG GTTCTCATAG CATCCCCCAA AGCAGGCTAG TGGCATGCAC 
105001 TTCAACAGAT GTAGTTCAAT GAGATTCAAA ACTGTACATC ATGCAGTAGT 
105051 ATTTCTCTTG AACTGAGAGT GCAGAAGTGC TATGTTGTTT GTATATCAGT 
105101 GTTGCAGGAC TTTCTCCTTA ATTCAGCTAA AAACAGGGTC CTTGTCACCT 
105151 GACCATGAAG ATTAGGCTTG AAGACACGTT GAAGGGTGAG AAGGGCAGGG 
105201 TTTATTGGGT GAAAAGGATA AAAAAGGGAA ACAGGGATTC TCAGCAAAAC 
105251 GAGAGTCCTG ATTGAATCCC AGGGACCACC TCAGAATGGG AGGGGCCAGG 
105301 CTCCTCCCCA TTTGCAAACA GCAGAAACTT CTGGAGGTTC CACCCCTGTG 
105351 TGCACTCCTC TCAGTGTGTA GGCTGGTTGG AAGTTCTTCC AGGAAGCCCT 
105401 TGTTACTTGG CTGTCTCATC AGTAAGCACT GGTAATCCAT CCAGTGAAAG 
105451 CAGTTTTGAC AAACTGTCAC TAAACTTACA GATGTGTACC TCAGAAGAAG 
105501 ATAATAAAAT CTCTCTGAAA GAGGAAATCT GAATATGTAA ATACATAGGT 
105551 AGGAAAAATA GCTCATTTGA AATCGTAAAT ATTGTAAATA TGTAGGTAGG 
105601 AAAAATAGCT CATTTGAAAC AACTAAAAAG TGATTATAAT ATAAAGATTC 
105651 ATGACCACAT CTTTCATACT AGCAAATGAG AAAAACAGGA TCACATTTTA 
105701 TCTGTAACTG CATATCTAAA ATCTCAAAGG AAAGAATAAA GAGGAGAAAA 
105751 TGCTTGCAGG TGTCTGTGAA TCCACATCTA GGAAATTAGG TTAGGAGTTG 
105801 TATTTGTTTT GTGTTGATAA TATCTTGGTA GTGGTAGTGC TGGGTGGCTC 
105851 TCCCAGCTCT CAGAATGCAG GTGTGGTAGG GGCTGGGAAA GGGGCACAAC 
105901 ATGGAGAATG TTCATGAACC AGAGTGGGCT AAGGGAAATA GGGTATTTGC 
105951 TAAAGAGAAG GGGGAATAAG CCTGATGACA GGAGAGAGGG AGAGAAGGAG 
106001 AGTGGGGAAA AAGGAGGAAG AGAAGGAGAG TGGGGAAAAA GGAGGAAGAG 
106051 AACAGACAAA AAGAACGAGA AAGTCATGGA CGACTGAGAA TACAGAGAGA 
10 6101 AAGCAGGGAA GAAAAAAGAC ATTTCTGAAC CTTAAACAGT AATTACCAAC 
106151 TTGACTTAAT GGCAGAAAGC AGTTGAGAAT AAGAGGCTTC CTTTTCAGAG 
106201 AACAGGAGTT GAAGAGAAAT GAGAGGACTT TTCCCTTTCT CTTCTTAGTC 
106251 AGTCTTCTTT CTGCCTTACA GGTTTTTTCT GTGCCCAGAA ACAGTCCTCC 
106301 AGCAGAAGAG CAGGAGACAA GAGTTAGTGT CCCAGAAAAT GAAGCTGAGG 
106351 CCCAGAAAAG ATAGATGACC CATGGCCTTA TAAGGTCTAA TAAAGACTAG 
106401 GTTTCCTTTC ATGATCGGTG TGGCTCCACT GTAGGCTTTT ACGTAGGTGA 
106451 GCACTAGAAT TAGATTTTTA TTTCAGAAAT ATCCCTTTGG AAGTATTATG 
106501 GAGGGTTGAT CAGAGAAGGC AGTTGCCTTA ATTCTGCCTC CAGATTGTTG 
106551 GTGATGAGCC CTCTGATTTC TGGTGTTTCA TGCTACTCTA AAGTGGTTAG 
106601 CACTCTCAGT CTGTCCACAA TTTCTCCAAG AATGGTCTTA TACAAGGTAC 
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106651 AATGTTCACA GTGACAGACG TCAAAGAGCA ACGTGCCAGG CTACTTTCTC 
106701 TAATATACTT CTGACATCTC ATTTTTATTG AGTTATTCTC TCCTTGAAGA 
106751 CTTTCACTCC TCATTCCCTG TAGGAGATGG CTAACTCGTT GGCCCGCTCT 
106801 CCATGGCCTT CTCCCTCTGG TGTCATCTGT ACTTGAAGAA TTTCCCATCT 
106851 ATCTTTCTCT GACTCTCCTC TCTGTTCAAA TTAATCTACT TATTTCCTAA 
106901 ACAGGCTTGT TTTATCATTG TTCTTTTCTC TTCTTGAAAG GGGCTCTCCA 
106951 CTCCTCTTTT CTGAGCCAAA CACTACCTAA ACTTTAAGGT CCACATAAAA 
107001 TGTTTCATGT CTAAGGAGGC AAAACATATT AAGGTCTGGT CCCTGGGTTC 
107051 AATCTCATGC TTGCTGCTTG CCAGCTAGTA GTACAATGTC ACAACCAGAA 
107101 TATTGACCCT GATGTAGTCA AAATACAGAA TACTTTCATC ATTGCAAGGA 
107151 TCCCTTCTGT TACCATTTTA TAGCCATATC TATTTCCCTC CCACCCTAAC 
107201 TCTATCCTGG GATCTCTGGC AACCACTAAT CTGTTCTTCA TGTCTGTAAT 
107251 TTTGTGGTTC CAAGAATGTT ATATAAATGA AAGTATACAG TATGTAACAT 
107301 TGTGGGGTTA GCTCTTTTCA CTCAGCATAA GCTTCTGGAG ATTCATCTAG 
107351 ATTGTTATTA TGTCAGGAGT TCATTCATTC CTTTTTATTG CCAAATATGT 
107401 ATTGCATGGT ATGGATGCAC CACAGTTTGC TTAACCTTTG ACCCATTAAA 
107451 AGACATTTGT TTCCAGTTTA GCTGTTAAAA ATAAACTACT GTGAACATTT 
107501 GTGTACAGGT TTTCACATGA ACCTATGTCT TCATATCTTT GAGATAAGTA 
107551 CTACCCAAGC GTGTAACTGC TGGGTTGTTT AGTAGCCTGC TTTGTTGTTG 
107601 TTGTTGTTGT TTAGAAACTG CCAAACTCAT TTCCACAGTA GCTATATGCT 
107651 TTCATTTTAT TTTTATTTTT TATTATACTT CAAGTTCTGG GATACATGTG 
107701 CAGAACGTGC AGAACGTGCA GGTTTGTTAC ATAGTTATAC ACGTGCCATG 
107751 GTGTTTTGCT GCACTCATAA ACCTGTCATC TACATTACTT ATTTCTCCCA 
107801 ATGCTACCCC TCCCCTATCC CGCCACCCTC CGACAGGCCC AATAAGCCCT 
107851 GGTGTGTGAT GTTCTCCTCC TGGTGTCCAT GTGTTCTTGT TGTTCAACTC 
107901 CCACTTATGA GTGAGAACAT GCGGTGTTTG GTTTTCTGTT CCTGTGTTAA 
107951 TTTGCTGAGA ATGATGGTGT CCAGCTTCAT CCATGTCCCT GCAAAGGACA 
108001 TGAACTCATG CTTTTTCATG GCTGCATAGT ATTCCATGGC GTATATGTGC 
108051 TACATTTTCT TTATCCAGTC TATCATTGAT GGGCATTTGG GTTGGTTCCA 
108101 AGTCTTTGCT ATTGTGAAGT GCTGTAACAA ACATTCTTGT GTGTTTGTCT 
108151 TTATAATCCT TTGGGTTTAT AAAATGATTT ATAATCCTTT GGGTATATAC 
108201 CCAGTAATGG GATTGCTGGG TCAAATGATA TTTCTGGTTC TAGATCCTTG 
108251 AGGAATTGCC ACAGTGTCTT CTACAATAGG TGAGCTAATT TACACTCCCA 
108301 CCAACAGTGT AAAAGCATTT CTATTTTTTC ACACCCTCTC CAGCATCTGT 
108351 TGTTTCCTGA CTTTTTAATG ATTGCCATTC TAACTGGCAT GAGGCTGTGA 
108401 ATTCATCTGG TCCTTGTTTT TTTTGGTTGG TAGGCTGTTA ATTACTGCCT 
108451 TGGTTTCAGA ACTTGTTATT GGTGTATTCA GGAATTCAAA TTCATACTGG 
108501 TTTGGTCTTG GGAGGGTGTA TGTGTCCAGG AATTTATCCA TTTCTTCTAG 
108551 ATTTTCTAGT TTATTTGCAT AGAGGTGTTT ATAGTATTGT TTGATGGTAG 
108601 TTTGTATTTC TGTGGGATCA GTGTTGATAT CCCCTTTATC ATTTTTTATT 
108651 GTGTCTATTT GATTCTTCTC TGTTTTCTTC TTTATTAGTC TAGCTAGGGG 
108701 TCTATGTATT TGGTTAATCT TTTCAAAAAA CCAGCTCCTG GATTCATTGA 
108751 TTTTTTGAAG GGTTTTTTGT GTCCCTATCT CTTTCAGTTC TGCTCTGATC 
108801 TTAGCTATTT CTTGTCTCCT GCTAGCTTTT GAATTAGTTT GCTCTTGCTT 
108851 CTCTATTTTT TTTTTATTGT GATGTTAGGG TGTTGATTTT AGATCTTTCC 
108901 CACTTTCCCC TGTGGGCATT TTAGTGCTAT AAATTTCCCT CTAAACACTG 
108951 CTTTAGCTGT GTCCCAGAGA TTCTGGTATG TTGTGTCTTT GTTCTCATTG 
109001 GTTTCAAATA ACTTATTTAT TTCTGCCTTT ATTTCGTTAT GTACCCAGTA 
109051 GTCATTCAGG AGCAGGTTGT TCACTTTCCA TGTAGTTGTG TGGTTTTGAG 
109101 TGAGTTTCTT AATCCTGAGT TCTAATTTGA TTGCACTGTG GTCTGAGAGA 
109151 CTGTTTGTTA TGATTTCCTT TCTTTTGCAT TTGCTGAGGA CTGTTTTACT 
109201 TCCAAATTAT GTGGTCAATT TTAGAATAAG* TGTGATGTGG TGCTGAGAAG 
109251 AGTGTATATT CTGTTGATTT GGGGTGGAGA TTTCTATAGA TGTCTATTAG 
109301 GTCTGCTTGG TCCAGAGCTG AGTAAAAGTC CTGAATATCC TGTTAATTTT 
109351 CTCATTGATC TGTCTAATAT TGACAGTGGG GTGTTAAAGT TTCTCGCTAT ' 
109401 TATTGTTTGG GAGTCTAAGT CTCTTTGTAG GTCTCTAAGA ACTTGCTTTA 
109451 TGAATCTGGG TGCTCCTGTA TTGGGTGCAT ATATATTTAG AATAATTTGC 
109501 TCTTCTTGTT GCATTAATCA CTTTACCATT ATGTGATGCC CCTCTTTGTC 
TTTTTGGATC TTTTTTGGTT TAAAGTCTGT TTTATCAGAG ACTAGGATTG 
109601 CAACCCCTGC TTTTTTTTTT TTTTTTTTTT TTTGCTGTCC GTTTGCTTTG 
109651 TAAATCTTCC TCCGTCCCTT GATTTTGAGC CTATGTTTGT CTTTGCATGT 
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109701 GAGATGGGTT TCCTGAATAT AGCACACCGA TGCGTCATGA CTCTTTATCC 
109751 AATTTGCCAG TCTGTGTCTT TTAATTGGGG CATTTAGCCT GTTTACATTT 
109801 AAGGTTAATA TTGTTATATG TGAATTTGAA TCTGTCGTTA TGTTGCTAGC 
109851 TGGTTATTTT GCTCGTTAGT TATGCAGTTT CTTCATAGTG TCAATGGTCT 
109901 TTATAATTTG GTATGTTTTT GTAGTGGCTG GTACAGAGTT TTTCTTTCCA 
109951 TATTTAGTCC TTCCTTCAGG AGCTCTTGTA AGGCAGGCCT GGTGGTGACA 
110001 AAATCAGCAT TTGCTTGTCT ATAAAGGATT TTATTTCTCC TTCACTTATG 
110051 AAGCTTAGTT TGGGAGGGTA TGAAATTCTA GATTGAAAAT TCTTTTCTTT 
110101 AAGAATGTTG AATATTGGCC CCCATTCTCT TTTGGCTTGT AGAGTTTCTG 
110151 CAGAGAGATC TGCTGTTAGT CTGATGGACT TCCCTTTGTG GATAACCTGA 
110201 CCTTTCTCTC TGGCTGCCCT TAACATTTTT TCTTTCATTT CAACCTTGGT 
110251 GAATCTGACA GTTGTGTGTC TTGGGGTTGC TCTTCTCGAG GAOTATCTTT 
110301 GTGGTGTTCT CTATATTTCC TGAATTTGAG TGTTGGCCTG TCTTGCTAGG 
110351 TTGGAGAAGT TCTCCCGGAT AATATCCTGA AGAGTGTTTT CCAATTTGTT 
110401 TCCATTCTCC CCGTCACTTT CAGTTATACC AGTCAAATGT CAGTTTGGTC 
110451 TTTTCACATA GTACCATATT TTTGGAGGCT TTGTTCATTT CTTTTCATTC 
110501 TTTTTTCTCT AATCTTGTCT TCATGTTTTA TTTCGTTAAG TTGATCTTCA 
110551 ATCTCTGATA TCCTTTCTTC TGCTTGATCG ATTTGGCTAT TTATACTTGT 
110601 GTATGCTTCA CGAAGTCCTC CTGCTGTGTT TTTCAGCTCC ATCAGGTCAT 
110651 TTATGTTCTT CTCTAAACTG GTTATTATAG TCAGTAATTC CTCTCACCTT 
110701 TTATCAAGGT TGTTAGCTTC CTTGCATTGG GTTAGAACAT GCTCCTTTAA 
110751 CTCAGAGTTG TTTGTTATTA CCCACCTTCT GAAACCTACT TCTGTCAATT 
110801 TGTCAAACTC CTTCTCTGTC CAGTTTTGTT CCCTTGCTGA CGAGGAGTTG 
110851 TGATCCTTTG GAGGAGAAGA GGTGTTCTAG TTTTTGGAAT TTTCAGCTTT 
110901 TTTATGCTGG TTTTTCCTTA TCTTTGTGGT TTTATCTACC TTTGGTCTTT 
110951 GATGCTGGTG ACCTTTGGAT GGGGTTTTTG TGTGGACGTG CTATTCCTTT 
111001 CTGTTTGTTA GTTTTCCTTC TAACAGTCAG GCCCCTCTTC TGCAGGTCTG 
111051 CTGCAATTTG CTGGAGGTCC ACTTCAGACC CTGTCTGCCT GGGTATCACC 
111101 AGCGGAGGCT GCAGAACAGC AAAGATTGCC GCCTTTTCCT TACTCTGGAA 
111151 GCTTTGTCCC AGAAGGGCAC CTGCTAGATG TCAGCTGAAG CTTTCCTGTA 
111201 TGAGGTGTCT ATTGTCTCCT GTTGGGAGGT GTCTCCCAGT CAGGAGGCAT 
111251 GTGGTTCAGG GGCCTACTTG AGGAGCAGTC TGTCCCTTAG CAGAGCTTGA 
111301 GCGCTGTGCA GGGAGACCTG CTGCCGTCTT CAGGGCTGGC AGGCAGGAAG 
111351 GTTTAAGTCT GCTGAAGCTG CACCCACTGC TGGCCCATCC CCCAGGTGCT 
111401 CTATTCCAGG GAAATGGAAG TTTTATCTAT AAGCCCTTGA CTGGGGCTGC 
111451 TGCCTTTCTT TCAGAGATGC CCTGCCTAAA GAGGAGGAAT CTAGAGAGGC 
111501 AGTCTGGCTA CAGAGGCTTT GCCAAACTGT TGTGGGCTCC ACCCAGTTCG 
111551 AACTTCCCGG TGGCTTTATT TACACTGTGA GGCGAAAATT GCCTATTCAA 
111601 GCCTCAGTTA AGGCAGACAC CGCTTCCCCA CCAAGCTCGA GCGTCCCAGG 
111651 TTGACTTCAG ACTGCTGTGC TGGCAATCCC TATTTTGGTA GGGATTGCAG 
111701 TAGATCTTAG CTTGCTGGGC TCTATGGGGG TGGGATCCAC TGAGCTAGAC 
111751 TTGGCTCCCT GGCTTCAGCC CCCTTTCCAG GGAGTGAATG GTTCTGTCTC 
111801 ACTGGCATTC CAGGTGCCAC TGGGGTATAA AAAAGAAAAC TCCTGCAGCT 
111851 AGCTCAGTAT CTGCCCAAAT GGCCTCCCAG TTTTGTGCCT GAAACCCAGG 
111901 GCCCTGGTGG TGTAGGCACG CAAGGGAATC TCCTGGTCTG GGGGTTGTGA 
111951 AGACTGTGGA AAAAGCGCGA TATCTGGGCT GGAAATGCAG TATTCTTCAT 
112001 GACACAGTCC CTCATGGCTT CCCTTGGCGA GGGGAGGGAG TTCCCGACCC 
112051 CTTGTGCTTC CTGGGTGAGG TGATGCCCCA CCCTGCTTTG GCTTGCCTTC 
112101 CTTGGGCTGC ACCCACTGTC TAACCAGTCC CACTGAGAGG AGCCTGGCAC 
112151 CTCAGCTGGA AATGCAGAAA TCAACCACCT TCTGCACTGA TCTTGCTGGG 
112201 AGCCGCGGAC TGGAGCTGTT CTATTGCCAC TGCTATACCC TTTTATATTT 
112251 CCACCAACAA TATATTAGTG ATCCCTTTTC TCCACATCCT CACTAGCATT 
112301 TGGTGTTCTC ACTGTTTTTT ATTTTAGCCA TTCTGATAGT GATAGAGTGA 
112351 GATCTCATTT TGGTTTTAAT TOGCACTCAG CAAATGGCTA ATGATGTTGA 
112401 ACGTGTTATC ATGTGCTTAT TTTCCATCTG TATATCCTGT GTGTTGAGAT 
112451 GCCTCTTTAT AACTTTAGCC CATTTTCTAA CAGAATTGTT TTCTTTTTTT. 
112501 TTTGTCAAGT GTTGAGAATA CTTTATATAT TCGATTCGTT TGTCAGGTAC 
112551 GTGGTTTGGA AGCATTTTCT CCCACTCTAG TTTTTCTTTT CGTCCTCTGG 
112601 ACAGGATATT TCACAGAGCA AAAGTTTTTT TTTTTTCTTT TTTCCTTTTT 
112651 CTGAAGTCTA ATTTATCAAG CCTTTCTTTT ATGGATTGTC CCTTTGGTGT 
112701 CAAGTCTAAT AACTAGCCCT AGATCCCAAA GATTCTGAAG TTTTTCTTAC 
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112751 ATGTTTTAAA ACAAGTTTTA TAGTTTTGCA TTTTATATTT AAGTTCATGA 
112801 TTCCTTTTGA ATTAGTTTTT ATACAAGATT TGAGGCTTAA GTTGAGGCTC 
112851 TCTCTTTGTT TTCTTTGTTG TTGTTGTTTG GTTTTGTTTT GGATCTCTAA 
112901 TTATTTTAGC AGTTTGTTTA AAAATCTATC TTTCCTCCAT TAAATTGCTT 
112951 TTGGACCATT TTCAAAAATC AGTTGGGCAT ATTTGTGTGG ATCTATTTCT 
113001 GAGCTCTCTA TTCTGTTCTA TCGATATGTG TGTCTATTCC TTCATCAATA 
113051 CCATATGCTA TTGATTACTA GAACTATATA TGGTAACTTG CAATCAAGTA 
113101 GACTATTGCC ATGCTATTTT TTTTGCTTTC AAAATTGTTT TTAAGCTAAT 
113151 CTAGTTTTTT TGCCATTTCA TATAAATTTT GGAATAATCT TGTTTATGTC 
113201 TATAAAAATA TTGGGATTTT GACAAGAATT GTGTTAAGCC TGTAAATCAA 
113251 TTTGTGGAGA ACTGACATCT TTAGTTTGTT GAGGCTTTCA ATCCATAAAC 
113301 ATGTTTCTTC ATTTATTTAG ATCTTTGATT TCTGTCAGTA GTACTGTGAA 
113351 GTTCTTAGCC TACAATTCTT ACACATGTTA GATTTACATC TAAATATAGT 
113401 TATTCCTTTT TCAATGATTA TAAATTGTAT TGTCTTTTAA ATTTTGTTTT 
113451 GCACGTGTTT ATTATTACTG TATAGAAATG* CAATTGACCC TTGTGTGTTG 
113501 ATCTTGTATC TTGTGACTTT GCTGAACTCA TTTATTAATT TCAGGAGTTC 
113551 TTTTGGGGCA CATTTCTTAG GATTTTCTAG GTAAGCAATT ATGTCATCTG 
113601 CAAAAATGCA GTTTTACCTC TTTGTTTCCA ATTTGCATGC ATTTTATTTC 
113651 TTTCCTTGCC TTATTGTTCT GGGCAGAACA TTCAGCACTA CTTTGAATAA 
113701 GACTGGCAAA AGTGTACATC TTTGTCTTGT TCCCAGCCTA AGGGAGAAGG 
113751 AATTCACTCT TTCACCATTA TGCACAACGT TATTTGTACG TTTTTAGAGA 
113801 TGCGCTTTAT CATGTTCAGA AAGTTCTCTT CCATTTGTTT TTTTCTTTTT 
113851 AAAAAATCAG TGGGTGTTGA ATTTTGTCAA ATACTTTTTC TGTATCAATT 
113901 GATCCAATCG TGATTTTTTT TTTCTGTTGG AATGGAGCGT TATACTAACA 
113951 GATTTTCGAA TTATCCATTA TTCCACAAAT ATACAAACTA TCCATGATTC 
114001 CCCCAGTTTT GTTCATTTTG CTTAGGATAG CTGTGGTTAC TTTGGGTCTT 
114051 TTGTGGTTCT ATATAAGTTT TAGGATTTTT TTCCCCATTT CAGTGAGAAA 
114101 TGTCATTGGT ATTTTGATAG GGATTGCATT GAATCTGCAC ATTGCTTCGG 
114151 GTAGTATGGA CACTTTAACA ATATTGATTC TTCCAATTTA TGAACATGGA 
114201 ATATCTTTCT ATTTTTTGTG TGTCCTGTTC AATTTCTTTC ATCATTGTTG 
114251 ATAGTTTTCA CTGTAGAGAT GTTTTACTTC TTTGGTTTAT TCCTAGGTAT 
114301 TTAATTTTAT TTGTGACTAT TGTACTGGGA TTACTTTTTA AAATTTCTTT 
114351 TTCAGATTGT TCATTGTTGG CATATAGAAA TGCTACTAAT TTTTACCATT 
114401 TTTTATTTTT TTGAGACAGA GTCTCACTCT GTCACCCAGG CTAGAGTGCA 
114451 GTGGCATGAT CTAAGCTCAC TGTAACCTCC ATCTCCTAGG TTCATATGAT 
114501 TCTTGTGCCT CAGCCTACTG AGTAGCTGGG ATTACAGACA TATGCCACCA 
114551 CACCCAGTTA ATTTTTGTAT TTTTAGTAGA TATGGGGTTT CACTATGTTG 
114601 ACTAGGCTAG TCTTGAACTC CTGGCCTCAA GTGATCTGCC CACCTCAGCC 
114651 TCCCCAAGTG TTGGGATTAC AGGCGTGGGA TTACCTGGCA CTACTGATTT 
114701 TTGTATGTCG ATTTTGTATC CTGCAACTTT ACTGAATTTA TCAGTTCTAA 
114751 TAGTTTTTTG GTGGAGTCTT TAGGTTTTTT TCCAAATATA AGATCATATT 
114801 ATTTGCAAAC AAGTATAATT TGACTTCTTC CTTTCCAATT CGGATGCCCT 
114851 TCATTTCTTT CTCTTGTCTA ATTGCTCTAG CTAGGACATC CAGTATTACG 
114901 TTGAAGAACA GTGGTTAAAG TGGGCATCTT TGTCATGTTC CAGATCTGAG 
114951 GAGAGGCTTT CAGTTTTTCC TCCTTCAGTA TCATACTAAC TGTGGGTTTG 
115001 TTGTACATGG CTTTTATTAT GTTATGTTCC TTCTATACCC AGTTTTTTTT 
115051 TGAGAGTTTT TATCCTGAAG GGAAGCTGAG CTTTATCAAA TGTTTTTCAA 
115101 CATCAATTGA AATGATCATA TGGTTATTAT TCTTGATTCT GTTTTTATGA 
115151 TATATCATAT TGATTGAGTT GTGTATGTTG AATCATCCTT GCTTCAGAGG 
115201 GATAAGGCCC ACTTGGTCGT GATGAATAAT CTTTTTAATG TGTTGTTGAA 
115251 TTTGGTTTGC TATTATTTTG TTGAGGATTT TTGCATAAAT ATTAATCAGA 
115301 GATATTGGCC TGTAGTTTTC TTTCTTTGGT GTTTCTTTGT CTGGTTTTGG 
115351 TATCAGGGAA ATACTGGCCC TATACAATGA GTTTGGAAAT ATTTCCACCT 
115401 CAGTTTTTTG GAAAAGTTTG AGTAGAATTG GTATTAGTTC TTCTTTAAGT 
115451 CTTAGTAGAA TTCAGTAGTG AAGCCATTAG GTCCCAGGCT TTTCTTTACT 
115501 AGGAGACTCT TTATTATGGC TTCAATCTCA TTGCTTGTTA TTGGTCTGTT 
115551 CAGGTTTTGG ATTTCTTCCT GGTTCAATCT TGGTAGGTTG TATGTGTCTA 
115601 GGAATTTATC CATTTCTTCT ATTTTTTTCC AATTTGTTGG CATATAGTTG 
115651 CTCATAGTAG CCACTAATGA GCCTTTGAAT TTCTGCAGTA TCAGTTGTAA 
115701 TGTCTCTTTT TTCATATCTG GTTTCTTTTA AAATTTGGGT CTTCTCTCAT 
115751 TTTTTCTTGG TCTAAGGGTT TGGGAATTCT GTGTATCTTT CAAAAAAACA 



WO 02/086113 




PCT/GB02/01887 



39/219 



115801 ACTTTTTGAG TCACTGATCT TTTATATTGT TTTCTTTATT TCAAATTCAT 
115851 TTATTTCTTT TCTGATCTTT ATTATTTCTT TTCTTCTAAT TTTGGGTTCA 
115901 GTTTACTCTT GCTTTTCTAA TTCTTTAAGA TGCATTATTA GGTTATTTAT 
115951 TTACTTTTTT TTCTCTTTTT CTTTTTTCTT TTTTTTTTTT TTTTTGAGAT 
116001 GGAGTCTCGC TCTGTCACCC AGGCTGGAGT GTAGTGGCGC GATCTCAGCT 
116051 CACTGCCTCC TGAGTTCAAG CCATTCTCCT GCCTCAGCCT * CCTGAGTAGC 
116101 TGGGACTACA GGCCCCTGCA ACCACTCCCG GCTAATTTTT TTTGTATTTT 
116151 CAGTAGAGAC GGGGTTTCAC CATGTTAGCC AGGATAGTCT TGATCTCCTG 
116201 ACCTCGTGAT CCACCCGCCT CGGCCTCCCA AAGTGCTGGG ATTACAGGCA 
116251 TGAGCCACTG TGCCCGGCCT TTTCTTTTCT TTTCTTTTCT TTTCTTTTCT 
116301 TTTCTGATGC AGGCACTTAT AAATTTTCCC CTTAGTACTG CTTTTGCTAT 
116351 ATCTCATAGG TTTTGGGTTG TTGTGTTTAT ATTATCATTT GTTTCAAAAA 
116401 ATTGTTTCAA TTTCTTTCTT AATTTCTTTA TTGACCCACT GATCATTCAG 
116451 GAGCACACTG TTTAATTTCC ATGTATTTGG ATAGTTTCCA AAATTCTTCT 
116501 TGTTATTGAT TTCTAGTTTT ATTCCATTGT GGTCAGAGAA GATGCTTGAT 
116551 ATTATTTCAA TTTTTTGAAT GTTTTAAGAC TTGCTTTGTG ACCTAGCATA 
116601 TAGTCTATCC TTGAGAAATA TCCGTGTGCT TAGAAGAAAA ATGTGAATTC 
116651 TGCAGTCATT CAATGAAATG TTCTGTAAAT ATCTATTAGG TCCATTTGCT 
116701 CTATAGTGCT AATTAAGTCC AATGTTTCTT TGTTGATTTT CTGTCTGGGA 
116751 GATCTATCCA ATGCTGAAAG TGGCATGTTG AATCTCCAGC TATTATTGTA 
116801 TTGGAGTGTA TTTCTCTTTT TAGCTGTAAT AATATTTGCT TTATATATCC 
116851 GGGTGCTCCA GTGTTGGGTG TATGTATATT TATAATCATT ATATCCTCTT 
116901 GCTGAATTGA CCCCTTGGTC ATTACATAAT GATCTTCTTT GTCTCTTTTT 
116951 ATAGTTTTTG TCTTGAAATC TATTTTGTCT GATATAAGCA TAGCTATGGT 
117001 TTCCACTGGT ATGAAATATC TTTTTCGTTC CCTTTATTTT CAGTTTATGT 
117051 GTATCTTTAT AGGTGAAGTG TGTTTCTCAT AGGCAACAGA TCATTGGGTT 
117101. TTGTTTTCTC ATTTGTTTAG CCAGTCTATG TCTTTTGATT GGAGAGTTTA 
117151 GTCTGTTTAC ATTCAATGTT ATTATTGATA AGTAAGGACT TACTTTTGCC 
117201 ATTTTGTTAC TTGTTTTCTG GTTGTTTTGT GGTGATCTTC TCTTTTTTTA 
117251 ATTCCTTTCT GTCTTTTAGT GAAGGTGATT TTCTCTGGTG ATATAATTTA 
117301 ATTTCTTGCT TTTTATTATT TGTGTATACA TTGCATTTTT TTCAACTGGA 
117351 GATTATCATG AGGCTTGCAA ATACTATCTT ATAACACATT ATTTTAAATT 
117401 GATGACAACT TAACATTAAT TGCATAAATA AACACACAAA AAGAAAACTA 
117451 GTAAAAATTC TACAGTATGA CTTAATCTCC CCCACTTTTT AACTTTTTGT 
117501 TGTTTCTCTA TATGTCTTTT TATATGGTCT ATGTCTTGAA AAGTTGCTGC 
117551 AGTTATTATT TCTGATTGGT TCATCATTTA CTCTTTTTAC TTAAGAGTAG 
117601 TTTACACATC ACAATTAAAA TGTTATACTA TTCTGTTTTT CTGTGTGCTT 
117651 GCTGTTACCA GTAAGGTTTG GACCCTCAGA TGATTTCTTC TTGCTCATTA 
117701 ACATACTTTG CTTTCAGATT GAAGAACTCC CTTTAGCATT TCTTGTATGA 
117751 CAGGTCTGGT ATTGATGAAA TCCCTCAGCT TTTGTTTGTC TGGGAAGATC 
117801 TTTATATCAC TATGCTTAAG GACATTTTAG CCAAATATAC TATTCTGGGT 
117851 TAAAGTATTT TCCCTTTAGC ACTTTAAATA TGCCATGCTG CTCTCTCCTA 
117901 GCCTGTAAGG TTTCCACTGA AAAGTCTGAT TCCAGATGCA TTGGAGCTCC 
117951 ATTGTATGTT ATTTATTTAT TTTTCTATTG CTGCTTTTAG GATCCTTTCT 
118001 TTATCCTTGA CCTTTAGGAG TTTGATTACT GAATGTCTTA AGGTAATCTT 
118051 CTTTCCATTA AATCTGCTTG. GTGTTCTATA ACCTTCTTGT ACTTGAATAT 
118101 TAATATCTTT CTCTAAGTGT TGAAAGTTCT CTGATATTAT TCTTTCGAAT 
118151 ACACTTTCTA CCCTATCTCT TTCTCTATAT TCTTGTTAAG GCTATTAACT 
118201 CTTAGATTTG TCTTTTTGAG GTTATTTTCT AGATTCTGTA AGTGTGCTTT 
118251 ATTTTTTTTT TGTCTTCTCT GTGTATTTTC AAATAATCTG TCTTGAAACT 
118301 CACTAATTCT TTCTTCCTCT CAATCAATTC TGCTATTAAG AGACTCTGAT 
118351 ACATTTTTCA TCAfGTCAAT TGCATTTTTC AGCTCTAGAA TTTCTGCTTA 
118401 ATTCTTTTTA ATTACTTCAG TCTTTTTGTT GAATTTAACT GATACAATTC 
118451 TGAATTCTTT CTCTGTGTTA TCTTTAATTT CTTTGAGTTT CCTCAAAATA 
118501 GTTATTTTGA ATTATCTACC TGAAAGGTCA TATATCTCTG TTTCTCCAGG 
118551 ATTTGTCCCT GGTGACTTAT TTAGTCCATT TGGTGAGATC ATGTTTTCCT 
118601 GGATGGTGTT GATGTTTACA GATATTTGTT AGTATCTGGG CATTAAAGAA 
118651 TTAGGTATTT ATTGTAGTCT TCACAGTTTG GGCTTGTGTA TGCCTGTCCT 
118701 TCTTGGCAAG GCTTTCCAAG TATTTGAATG GTGTGATGGT TAATACTGAG 
118751 TGCCAAATGG ATTGGATTGA AGGGTGCAAA ACGTTGTTCC TGGGTATGTC 
118801 TGTGAGGGTG TTGCCAAAGG AGGTTAACAT TTGAGTCAAC AGACTGGGAG 
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118851 AGGCAGACCC ACCACCCATC CTTAATCTAG GTGGGCACTC TCTCTTTGGC 
118901 TGCCACTACA GCTAGAAAAA GCATGCAGAA GAAAGTGAAA TAAGCAGACT 
118951 TTCTGAGTCT TCCAGCCTTC ATATTTCTCC TGTGCTGGAT GCTTCCTGCC 
119001 CTTGAACATT GGACTTGAAG TTCTTCAGCT TTTGGACTTT TGGACTTGTA 
119051 TCAGTGGTTT GCCAGGGGCT CTCAGGCCTT TGACCATAGA CTGAAGGCTG 
119101 TACTGTTGTC TTCGCTACTT TTGAGGTTTC TGGGACCCGG ACTGGTTTCC 
119151 TTGCTCCTCA GCTTGCAGAT GACCTATCGT GGGACTTCAC CTTGTGAATG 
119201 TGTGAGTCAG TATTCCTTAA TAAACTCCCC TTCATATATA CATCTGTCCT 
119251 ATTAGTTCTG TCCCTCTAGA GAACACTGAG TAATACAGAT TTTGATACCA 
1193 01 GGGTATCAAA ATTCTGTTCT AGAGGAACAG AATTTTAAGG ATGGATTCCT 
119351 TTAGTTGGTT TTGGGGTTTC TGGAGTTGGC TGCTTAATCT GGGACTCTAC 
119401 TTCTAATAGT ATGGAGGACA ACTGATACTA TTTGGCATGA ACTGTTTAGA 
119451 GTTATGTAAA ATAGATGCAT TTGATACTCC CAATTCACAG GCAAGGAGTT 
119501 TAGTGACTCT ATACATAATA CCT^TAACCA TATATGGATA ACCAAGGAAT 
119551 ATAATGAAGT TGGTTGGTTG CTCCTAAGTT TGCTGGACAA AGCGATGAAA 
119601 GAAAATGATG AGCTCAGAGA TTCTAACTCC CAGCTTCAGA ACCACATACT 
119651 GAGCCTCAAA TCTTCTAAGA TTGCCCTGAG TGAGAGTCTT ATCTCCTGTA 
119701 GACAAAGGGC TGAAATTGTG GAAAATCGGA CACAAGCTCT TATCATGTGA 
119751 GAAAGGTGCA TGCACAAACA CACCAGGTGT CTACTGTTAA AGTGAGGATT 
119801 GATTGCAAAA CAATGGGACC TGAAACTTAG AATGGGGACT TGTGGGAGGA 
119851 CGCTGCTGAG GGTGGAGGGA CTGAGCTCCT AAGTTCTGAT GAGCCCTTTT 
119901 GTTGCCAGAG AAAACAGCCT CTCCATCCCC AGTAGTGGCA ACATCACCOT 
119951 CCCCACCCAT GCTGCCATCA GCCTTTCCAC CTTTGTGTGA GGAGATTAAC 
120001 CATCCACTGC CCGGGGCAAC AGTGATGGCC TCCCTTGAGG CAGTTGCCAG 
120051 GCAAGACAAT GCTGATTCTC TTCAGGACCC ACCCGCAATA CCCCTGTTTG 
120101 CTTCTAGACC TAAAACTAGA CTCCAGTCCC AGCAGGCCCC TAGAGATGAG 
120151 GTTCAGAGTG TGACCCATGA CAAGGTGTAA TACATTCCAG AAGAACTGCT 
120201 TGTGTTTTTA AATTTATATA AGCAGAAATC TGGAGAACAG GCATGGAAAT 
120251 AGATATTATG GCTGTGGGCT AGTGGTGGAA AGAACATAAA GTTGAACCAG 
120301 GCTTAATTTA TTGATTTGGG CCCACTAAGC AGGGATTCTG CATTTAATGT 
120351 TGCAGCTCAG GGAGTTAAAA AGGGTTCTAA TAGTTTATTT GCTTGGTTAG 
120401 CTGAAATACG GATCAAAAGA TGGCCCACTG TAAGTGGGTG TAGCTGCCAT 
120451 AATGGACAGC AGAGACAAAG CAGCAAACAC ATAGTCTGAC TTTTGTAGCG 
120501 CTCTGGCATT GGCTAATCAC AGTGTTCCTA GAAGTGAAAT TGATAGGAAG 
120551 CCTACTGCAT TCTTACTTAA TTTGAATAAG CAGAAAACTT CCAAGTAGAG 
120601 TGGACAAAAG ACTAATTTGA ATTATATAAA AAGAGAATCA TGGCCCCTCA 
120651 ATAAATTTCC AGACTTGAAC CAGTTTATAG ACCCAGAGCC CCTTGAATGA 
120701 AGAGGAGGCT GGCTCCCCTT GAGAAAAAAA TCCCACTACA CTACTGACAA 
120751 TTATACTGTT AATTTTTCTC TCATCTTTCC TCAATAAGAC ACTCAGGCTT 
120801 TTACCAGGGT AACTGCACTG GGGAAAGGGA AATGGTCAGA ACCTCGGGGG 
120851 ACTACAGGAC ACTGGGTCTG AGCTGATGTT GATTCCAGGG GACCCTAAAC 
120901 GTCATTGTGG TCCTCCAGTT AAAGTAGGGG CTTATGGAGG TCAGGTAATT 
120951 AGTGGAGTTT TAGCTCAGGT CTAATTTATA . GTGGGTCCAG TGGGTCCCTG 
121001 GATTCATCCT GTAGTCATTT CCCTAGTGGC GGAATGCATA ATTGGCATAG 
121051 ACATACTTAG CAGCTGGCAG AATCCCCACA CTGGCTCTCT CACTAGTAGG 
121101 GTGAGGGCTG TTATGGTGCA AAAGGCCAAA TGGAAGCCAT TAGAGCTGCC 
121151 TCTACCCAGA AAAATAGTAA ATCAAAACCA ATATTGCATC TCTTGAGGGA 
121201 TTGCAGTACC ATCATTACTT AAAAGATGCA GGGGTGGTGA TTCTCACCAC 
121251 ATCCCCAATA AACTCTCCTA CTTGGCTTGT GAAGAAGACA GACAGATCTT 
121301 TGAGAATGAC AGTGGATTAT TGTAAGCTTA ACCAAGTGGT GACTCTAATT 
121351 ACAGCTGCTG TACCAGATGT GGTTTCATTG CTTGAGCAAA TTAACATATC 
121401 TCCTGGTAAC TGGTATGCAG CCACTGACTT GGAAAATGCC TTTTTCTCCA 
121451 TTCCTGTCCA TAAGGCCCAC CAGAATTAAT TTGCCTTCAG CTTGCAAGGC 
121501 CAGCAATGTA CCTTCACTGT GCTACCTCAG GGGTATATCA ACTCTCCAGC 
121551 TTTGTGTCAT AATCTTGTTT GCAGAGATCT TGATCACTTT TCCCTTCCAC 
121601 AAGATATCAC ACTGGTTCAT TGCATGGATA TTAACCTGAT TGGACCCAGT 
121651 GAGCAAGAAT TAGTAAATAC ACTGGACTTA CTGGTGAGAT ATTTGCATGC 
121701 TAAAGGATAG AAAATTAAAC CCGACTAAAA TTCAGGGATC TTCTACCTCA 
121751 TTGAAATTTC TAGGGATCCA TTGGTGTGAG GCCTGTCAAG ATATTTGTTC 
121801 TAAGATGGAG GATAAGTTGC TGCATTTGGC CCCTTATGCA ACCAAGAAGG 
121851 AGACACAATG CCTAGTGGGC CTATTTGGAT TTTAGAGGCA ACATATTCCT 
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121901 TATTTGGATG TGTTATTCCC ACCATTTATC CAGTGACCTG AAAGGCTTCC 
121951 AGCTTTGAGT GGGTTCAAGA ACAAGAGAAG GGTCTGCAAT AGGTCCAGGC 
122001 TGATGTGCAA GCTGCTCTTT CACTTGGGCC ATGTGACCCA GCAGATCCAA 
122051 TCGTGCTTGA AGTGTCAGTG GCAGACAGGG ATGCTGTTTG GAGCCTTTGG 
122101 CAGGTCCATA TAGGTGAATC ACAGTGGAGG TCTCTAGAAT TTTGGAGCAA 
122151 GGCCCTCCCA TCTTGTGCAG ATAACTACTC TCTTTTTGAG AGACAGCTCT 
122201 TGGCCTGTTA CTGGGCTTTG GTTAAAACTG AATGTTTGAC TATGGGTCAA 
122251 GTCACCATGC GATCTGAACT GCCTATCATG AACTGGGTGC TTTCTCACCC 
122301 ATCTAGCCAT GAAGTTGGGT ATGCATAGCA GCATTCTACT ATCAAATGGA 
122351 AGTGGTATAT ACATGGTCAG GCTTGAGCAG GCTCTGAAGT CACAAGTAAA 
122401 TTACATGAGG AAGTGGCTGA AATGCCCATG GTCCCCACTC CTGCCACCCT 
122451 GCCTTCTTTC CCCCAGCCTG CACCAATGGC ATCATGGGAA GTTTCCTATG 
122501 ATCAGTTGAC AGAGGAAAAG AAGACTAGGG TCTGGTTTAC AGATGGTTCT 
122551 GCATGATATG CAAGCACCAC CCAAAAGTGG ACAGCTGCAG CACTATAGCC 
122601 CCTTTCTAGG ACATCTCTGA AGGACAGTGG TGAAGAGAAA TCTTCCCGGT 
122651 GGGAAGAACT TTGAGCAGTC CACCTGGTTG TGCACTTTGC ATGGAAGGAG 
122701 AAATGGCCAG AGATGTGATT AAATACTGAT TCATGGGCTG TAGCCACTGG 
122751 TTTGGATGGA TGGCCAGGGA CTTGGAAGAA GAATGATTGG AAAATTTGTG 
122801 ACAAAGAAAT TTGGGGAAGA GGTATATGGA TGGACCACTT TCAGTGATCA 
122851 CAAACTCTGA AGATATTTGT ATATCATGTG AATACTCACC AAAGGGTGAC 
122901 CTCAGCAGAG GAGAATTTTA ATGATTAAGT GGATAAGATG ACTTGTTCTA 
122951 TGGACACCAC TCAGTCCCTT TTGCCAACCA CCCCTGTCAT TGCAGTGGGC 
123001 CCCTGAACAA AGTGGTCTTG GCAGCAGGGG TTATGGAGGT TATGCATGGG 
123051 CTCAGCAACA TGGACTTTCA CTTACCAAGC CTGACCTGGC TAAGGCCACC 
123101 ACTGAGTGGC CAATTTGCAA GTAGCAGAGA CCAACACTGA GCCCTCAATA 
123151 TGACACAATT CTTTAGGCTG ATCAGCCAGC TACTTGGTGG CAGGTTAATT 
123201 TTATTGGACC TCTTTCATCA TGTAAAGGGC AGCAGTTTGT CCTCACCAGA 
123251 ATAGACACTT ACTCTGCATA TCGGTTTGCC TATATTGCAC TCAATGCTTC 
123301 TGCCAAGACT ACCATCTGTG GACTCATGGA ATGCCTTATC CACCATCATG 
123351 GTATTCCACA CAGCATTGCT TCTGACCAAG GCACTCACTT TGCAGCTAAA 
123401 GAAGCATGGC AGTGAGTTCA TATTCGTGGA ATTCACTGGT CTTCCCATGT 
123451 TCCCTATCAT CCTGAAGCAG CTAGATTGAT AGAACGGTGG AATGGCCTTT 
123501 TGAAGTCACA ATTACAATGC CAACTAGGTT ACAATATTTT GCAGGGCTGG 
123551 GGCAAAGTTC TCCAGAAGGC TGTGTATTCC CTGATTCAGC ATCCAATATA 
123601 TGGTACCATT TTTTCCCACA GCCTGAATTC ACAGGTCCAG AAATCAAAGG 
123651 GTGGAAGTGG AAGTGGCACC ATTCACCGTC ACCCCTAGTG ACCCACTAGC 
123701 AACATTTTTG CTGCCTTTTC CCGCGACATT ACATTCTGCT GGCCTAGAGG 
123751 TCTTAGTTCC AGAAGGAAGA ATGCTGCCAC CCAGAGACAT AACAATGATT 
123801 CCATTAAACT CAAAGTTAAG ATTGCCACCT GGCCACTTTG GGCTTCTTCT 
123851 ACCTCTAAGT CAACAGGCTA AGAAGGGAAT TACAGTGTTG GCTTGAGTGA 
123901 TTGACCCAGA CTATCAAGAT GAAATCAAGC TACTACTTCA CAATGGAAAT 
123951 AAGCAGTAGT ATTTGTGGAA TACAGGAGAT CCCTTAGGGT GTCTCTTAGT 
124001 AGTACCATGC CCTGTGATTA AGGTCAATGG GAAACTATAA CAACACAATC 
124051 CAGGCAGGAC TACAAGTGGC TCAGACCCTT CAGGAATGAA GGTTTGGGTC 
124101 ACTCCACCAA GTAAAAAACC ATGACCCACT GAGGTGCCTG CTGAAGGTAA 
124151 AGGGAATGCA GAATGGGTGG TAGAAGAAGG TAATCATCAA TACCAGCTGT 
124201 GACCACGTGA CCAGTTGCAG AAACAAGAAC TGTAATTGTC ATGAGTATTT 
124251 CCCTCTCATT TTGTTAAGAA CATATTTGTG CACGTATACA CTTGTACTAA 
124301 GAAAATATCT TCATTTTATT TCCTTTCTTT TTCTTTTATT ATGTGACATA 
124351 GATTGATTGA CTTTATATAA GCATTTAAGT GTTGTTAACT TTATGTAATA 
124401 GCATTTAGGT TAAGGATTAG TGCATTTCCA GTTGTATGAA GGATAGCTGT 
124451 ATTATCTTAG GTATAATTAT GACATTATTG TCTTTATTTG AAGATTATGT 
124501 ATTATTTCAG GAGATGTGCA AAAGCACTCT TATAGCCTCT CCAGCTGGTG 
124551 TTTTGCTAGG TCATGTGTGA CCTCAGTTCA CTGGCTCTAA GCCCAGCCTT 
124601 TCACTAGGAG TTTCCTAGGA ATTGCAGTCC TTGTGTCTTA GACTGCTTTT 
124651 CAAGTTTACC TAGGACCCCA AAGTACTTCA GCCCATGCTG GCAAGGTTTG 
124701 CTGAGAAACT CAAATTCTGA CTGCTGGAAT GGGGCAATTC CCCTCTGGCT 
124751 AGGGCTGGTC CAACTTCTTC CATGCACAGG CACTGGCTGA TCCCAGTGTG 
124801 GCTTTATTCT CTACTGTGAC AGGACATTAC TGAGTTCAAT GTAAAGTCCC 
124851 CCAGTCGCTG AGCTCTCCTT CCCCAAAGGG CACAGATTGT CTTTCTGCAC 
124901 TGCACAGTCA TTGTCCAGAG GTGGGAGAAA GGGCATTGGC AATTCAAGAC 
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124951 TGTCTATCTT GCCCTCCTCA ATGCCTCTTT TGACAATATG AAGTTAAAAC 
125001 CAGGTACTGT GATTGCTCAC CTGATTTTGG TTCTTGTGAT GTTGCTTTTC 
125051 TGTGTGGAGG TAGTTGTTAA AATTTGGTGT TTTAGCAGGG GAGAACAATG 
125101 GTGTAGGCTT CTGTTTTGTC ATCTTGCTCC ATCCTCTCTC CTTGGGTATT 
125151 TTACATTACA ATTTAATTTC CATAGAAATT ATAGGGCTGT TCAGACGAGA 
125201 TCGGGCACAT TCCGGGTGGT ATGGCCGTAG GCTATAGGCC TATTCAAATG 
125251 GTCTATTTCA TATTGGTGAG TTGCGATAGT TTTTGTTTTT CAAGGAATTG 
1253 01 ATCCATTTTG TCTAGCTGTC AAATTTATTT GTCTAGAGTT GATCACAGTA 
125351 TTTCCCTTTT GATGTCTGCA GGGTCTGCCG TGATATCTCC TGTTTCATTC 
125401 CTGCATCGTT AGATATTCTG GCATCTCTGG CATCTTGATG TTATCACGTA 
12 5451 TGAATTTTTT TTTGTCGTTT AATTTCAGAT ATTCCTGGTT CTTGCTGTGA 
125501 TGAGTAGTTT TTGATGGAAA CTTTTCAATC ATCAAACAAT ACATTTTTAT 
125551 ATTTTGTTAT AAGGTTCTGG ATTTTATTTG AGCTTTTGGT TTTAGCTGGC 
125601 TTTATTTTAC TACACTCTGT TAGAGGAAGG GACACTGCCT CATTACTGCC 
125651 AGGTGGAGGG ACAAGTTCAG ACTCTCCAAA AGGTCTGCAT GAACACCATG 
125701 GCAGGGGTGG CTTTTATTCT GGCTGATAGG GTAAACGTCC TTGCTCCTGC 
125751 CTTGCCTTCC TTGACATACC TTTGTTATAT CCTTCTGTGT GTGGAAGCCT 
125801 AGACTCCCCA TTTGGCCTTT GCTGGTGGAT GTGGAGGTGA GACCACAACT 
125851 TTTTTTGTAG TGTTTGGCTG GAATAGAGTG GTTATTGTCT AAAATTTTAC 
125901 TATCTGGCTA AGCTACTCTT TTCCTTCTTA TTTGGCTAGA GGATTTTGAA 
125951 TCTTGGAGCT TTTGGTTGTT TTGTCTATGA GTGTTAATAT TTCTGGGTTA 
126001 CTGACATTTT CATC TCC AAA TTTCAGTTAT TAAAAAAAAA GAAAAGAAAG 
126051 GAAAACCCAG GAGACTTTCC ACTGTAATGT TCCTTGGATT TTGAGGTCCT 
126101 TAGGTGGTCT GTCTTATTTT CTCCATCCTT CAGTGTCTTC TTATGCTTGT 
12 6151 TTTATATAAA ATATCCAGAG ATTTTTGTTG CACTTAGTGA GAAGACTAAG 
126201 GAAAAATATA TGTATTTTAT CTTTCTGGAA GTGTATGTAC TCTATGGCAA 
12 6251 TATACATGAA GTGTTTATCA AATAGGAGCT AAACAAAAAT GTGGCAGAAT 
126301 CATTTAATGA AATATTAACT TGTATTTCTT TTTACTGTAT TTCCTTTTTT 
126351 CCCCTCATTT CTATCGGAAG CTTTTCTTGT CAGACCACTA AGGACCCCAC 
126401 AGCCTTTATA TTTATGAACT GTAGTGTAAA ATTCAGGATG GGATCCAGAA 
126451 AGGTGTTTGT CTGGAAGCTG GAAGATTAAC AACAAAGCAC CAGACCAAAT 
126501 TATTAGAATG GGGCTTGATT TGCAGCGTCA CATTTTTAAA ATCACTAGAT 
126551 TTATTTGGTT AATGGCAGTT GGGGAAAACT GACATAGATT TGACTACTCA 
126601 TTTTTAAAAA AATTACCACC AAGTTTTCCC CATTCTCCTG TTTTCATGGT 
126651 TTCTCTTGCC AGCTTTTCTA AAAATTTAGA/ AATGCTTTTC TGCCTTTTTT 
126701 TAAATCTTAT ATGAATATCA CCTTTGCTGC TGCTAATGAG AAGAGATCCT 
126751 TTTCTGCTCC AGTATTGTAA TTTAAAATTA ATTGAGTAAA CTGATTAAAG 
126801 TTATAAAATT ATTAGAAACG TATGAGATAA AGGACAAAAT GTACAGGTTG 
126851 AGTAATAGTG TTTTTTTTTA ATGGCTTGTC TTTACTGAAG AGTTTTTTTC 
126901 AGTAGAACAG ATACAATTAT GCAGGGCATT ACTTATGCAA AAATCATATA 
126951 AGCAGCATTT ATTTTTGTGT GTATCTATTC TAATTCAGTA TCTGCAATTT 
127001 AAAAATAATA TTATTTCTAT TTAAATTATA GATACTGTAT ATGTTCATTG 
127051 TGAAGGATAT AGAAAACACA CAAAGGTACA AAGAGAAAAT GTGAAAAAAG 
127101 TCAAATAATT TTACCAACAG AAAACCAATG TTAACATTAA TGAAATTCCT 
127151 TTCTATTTTT AAATCTCTGT GTATATATGT GTGTACACAT GCATATTTTG 

1272 01 TAGTCTAATT TCTGCTAAAT TGGTGTGTGA ATACATACAC CGCACAGACA 
127251 TACACACTTA ATATTGTATT GTGAGCATTT TCTTAGGTCT ATAAACTTTT 

1273 01 AAAAAATAAT TGTATTATAG GGAATATACC ATCTTTTTGT TCAACCATTT 
127351 TCTTAATATA AATAACAATT TAATACATAT AATCTCTGTT AAAGTTTTTG 
127401 ATTATTTGTC AAGGATAGAT TTTTTTTAAG GATATCAATA TCTTGGAACA 
127451 TAGAAACTTT TTAAACCTTT AGTGCAGCAT TACAACATTT ATTAGCAAAA 
127501 AGTTGTGCCA GTTTGCAAAC TCGTGAAGAT TTCAAGGCAC CCGTTTACTA 
127551 TACCATCACT AACAATGAGT AGTATTTTTT TCAATCTTTG CTTTAAAATT 
127601 GTATGCTAAT ATAGGTAAAA GATGATACCT TTCTTTTTAA TTTGCCTTAG 
127651 AAATCATTAT TGAGATAAAA CATTTGGCCA TTTGCATTTC TACTTTTGTG 
127701 ACTTATGATT CACATCATGA TTCTATTAGA GTTTTAATTT TTTATCTTAC 
127751 TCATTTTAAA GAAATCTGTA TGTATTGAGG ACATTAAACT TATGTCATAT 
127801 TTGTTACAAA TGTACTGTTT ATTACTTGGC TTCTAAGGTT GTTATAAGAT 
127851 TTTGACATTC AAAAGTTTTA AAAATTTTTT AGTGAAATTA CCTTTGTTAC 
127901 TTCATCCATT TCTACTATGA TTAGAAACTG CTTAGATCTG ATAAATATCA 
127951 TCTGCCTTTC TTCTATATGT AAGTGGTTGT GTTGTTTATA TTGAAATCCA 
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128001 TATCTTTCTG GAGTTCATTC TTGATGTAGG GTGTGAGATA AGATCTAGTT 
128051 TAATTTTTCC TATTAGTAAG AAATTGCTGT TAATAATATT TCCCTTTCAT 
128101 GTTTATTGAA ATGTCAGTTT AAACTTATTT AATTTAGACA TATATTAGTT 
128151 GCTGTTTCTT AATTTTTATT CTTTCTGTCC ATCTATGTTT ATAGTAGAAC 
128201 AACTCCAATT GTAACATTTT AGTATTTTTT ATTATACTAA TAGATGGTAC 
128251 TAAAAATTGT TTTAAAATTG TATTAGCCTT TCTTGGATGT TAATTCTTTC 
128301 AGATGAACTT TAAAAAATAC TTTATTAATT CTTTGTCTCC CCCTTATAAA 
128351 ACAAAACAAA ATGAAACAAA AATTAAAGCA AAGCAAGAAA ACAAACCAAA 
128401 TCCCCTGGGG CTTATTTAAT ATAGTTGCAC CAATCATATA AATTGTTTGG 
128451 AGGAATTAAT TGACATTGTG ATAATACTCA GTCTTCATTT CCAAGATGAT 
128501 GACATCAATC TCTGTTTACT CAAGTCTTCC TTCATCTTTC AGCAATGTTT 
128551 TAACTCTACA GGCAAAAGGA GAGGTTTGGG TGAGCAGAGG TGGCAGATGA 
128601 CTTCATGCAG AGGCCTGCAG GTTACTCTCA GGACCTCTCC CACAGTCTAG 
128651 CCCTCAGCCT TGAAATGATA GGCACTTGGT TTTGTTAATA TGCCAAGAAT 
128701 GGGTCTGTCA CTCAGGAAGA AATTTAAATA AAACATTCTC AGAAATGTGA 
128751 TGGATAAAAT CAGAATGTCA GTAGATTGAT TTACAAAGAT AAATAAATGT 
128801 AGAGGAAATA GGCCCCTCAT GCATTGTAAC CAGAACATTG CCTCCCTGTT 
128851 GGTTCCCATT CAGGGGAAGA AACACAGGAG CAGAAGATGG CTTTGTGAGC 
128901 CTGGAGTCAG AACAAATGAT TTGAATTATG TAATTTAATT ACTGAACAAA 
128951 ATCAGGTTAC ACCAGTCTAA GTCCATCCTT TCTATATATG TACACCTGCA 
129001 AGGTGGTAAG GAAAGCAGTT TTTTTTTGTT GTTGTTGTCT TTGTTTTTCT 
129051 ATATGGACTA TCTTTCTGTG TTTTGTTTTC CTTTACACAT TTGGGAAATA 
129101 TCATCTTAAT AAATACTATG CTTTAACATT TTTTAATTTA AAAATATACT 
129151 TAAGCTTCCC TCCTTTCTGC CACCTCCATT AAAATTATAG TAAAAAACAA 
129201 AACAGGCATA GAGCCAGAAG AGCAAAGAAA GCAGCAAAGA GGTCCATCCA 
129251 CTTATGAGAG AACTGAATAA ATAGCCAGAA GACGGAAAAC AAGGGAAATA 
129301 AATGATGACA AAGTAGAGAA AGAGACAGAC CAACTACACG AAGAAGTTTG 
129351 CTTCCAAGGT AAGCATCTTT CTGCCTGGCA GAATTCCAAG AAAGCACCAT 
129401 GTTTTGAGAC TTGGGAGGTA AAGAGAAGTG AGGTGTGTTG ACAACAGTGG 
129451 GTTTGATGGC CAATCTATAG GATTTTAAAA TTGGCATCCT TCATTCACAA 
129501 CTGGTGTTCA GAGTGAAGGC AAAACTTCCA TTTCCCTAGG AAATCCTTCT 
129551 CCAGTGACCA GAGGAATGAC CAGCTCCCCA CCTTCTCACT GTCCAGCAGT 
129601 CCCTGCTAAC CCACAGGTAT GTGCTGATGC CAGTTCACCT ACTTCCTCAC 
129651 CTGTGAGCAC AGGTCACAAA GAGGGAAGAT GTTGAGGAAG ATTTGAAGCC 
129701 TGAAGGACAG AGGGGGACAG CACATTAGGA ATTCTAGTGA GTTAGTTTAA 
129751 ATTCCTCACT TTCTCAGAGT GAAGTTAACA AACAATACCA AATATTTGAA ' ' 
129801 AAAAAGCAAG CAGTAAAAGT GTAAGCATTA TAATTCAAAA TACATAAGTT 
129851 GAAAACAAGG AAAACAACAT AAATGATGAA AAATAATTGC TCCTTCTCCT 
129901 TGATATTGAG ATCAGAGAGT GACAATAGGG TGATAAGACA TTGCTTTAAA 
129951 AAAATTCCTT ATGTACTTCT TGATTTAAAA AAATGTAAGT ATTATTTGGG 
130001 ATTAAAAACA AAGGAAAACA TTTAACATGC AGAATGAATT TATAAACAAA 
130051 AGAAACTATG TTAAACCATT TTCATTTATA TTGTTTTATG TTTTCAATTT 
130101 TAAAAATGAA CCACCTTTTG TATAAGGAGC ACCCAACAAA ATATCTGGCA 
130151 CTTAGTAGTT GCTCAGAAAG TACTTGTTGA ATAAATAAAC TTATTTGCAT 
130201 TTTTAAACTT TTTTCCTTCA TTCGCCTCAT TCTTAGTTTT CGTTTAAATA 
130251 ATTTAGAATT CTAGATTTAC ATTATTATTG TTATTATTAT TCTGTTACAC 
130301 ATACATCTTA TAACTCATAT TAGTGACCGT AGCCCATCCT TTTAAAACCA 
130351 CAATAAAAAA CACAACTTCC TTATTCTTGA GTGTTTAATT TCAAATAACC 
130401 TTTTTACCGG CTGGTCTACA TCTTTCAGTA ATTTTCTCAG* AAAGAATATT 
130451 TGGGAATTAG ACTTTTTAAG CCCTTAATAT CTGAAAATAA CAATCTGTGG 
130501 ACTTCCCACA CAAGTGATGA TTTGAATGAA TATAGAAATC TCTAGTTAAC 
130551 TGCTTTTGAT CTAAAATTCT ACATTGTCCT TTTTTTGTTG TTTCTTCTTC 
130601 CATGTTGTGA AAGGGTAGTC TGATGAATTT GGCCAGAATT TGGTCCCTTT 
130651 ATGGTTTATC TGTTGATCCT TGGCTACCTG TAGAATCCTT TCCTGGATCT 
130701 GAAATTTAAA ACTTTTCAGG AAGTCTATTT TCATTAGATT TTCTAATTTT 
130751 TTTGTGAACA CTTTTTGATT TGTCGAAGTG AGGTAGAACA AAACCGAATA 
130801 TCACCAGCTG TTCATCTTCT TAACTTGGAA GTTCCTTCTT TTTTTACTCT 
130851 AGAGAGAGAG TGTATTTTTG AGTAAGTAAA AGTAGTTTTC GAATGCATTT 
130901 GGGGTAAAGC ATCATCAAAT GAAGGATATA AACTCCATAG ATACATCACT 
130951 ATGACTTACT AACAGGACAA CTT^TCCTTT AGTTAACTAT CCGTGGGAGA 
131001 AACCAATATG TGAAGAATAG CTATAATTTC TTGTTCCATG GAGTTTTTTC 
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131051 AAGTTACCCC AAAGAATTTG ATAACATGTC ATCAAGTGAA" AGATATAAAG 
131101 AACTCATGTG TAGTTTCACA GGACTGTGCT TAGGATACTC TATTTTTAGT 
131151 TTGAGTACCC CTAGTGTTCT TGGTACTTTT CAGTTATCTT GCTGCCTCTC 
131201 ACTGAGTCAA AGATAAATTC AACAAAGGCA GAGACAATAC CCACTGATTG 
131251 TCTGCATCCC ACATAGTGGG ATACATAGAA AGTGCTCAGT TAGCTTTGAT 
131301 TGACAGATGT GCTAATCATC CACTTGGTTG GTAGTCTTAC AGCTGTCCCT 
131351 GCTTGAACCC AAGCTATATT TACCTCTAAT TCAGACATGT TTTGATTGTT 
131401 ATGTGGGTAG ATATGATAAG CCTTGCAGGC CTTTTTACAT GAAGTACCTA 
131451 GGGCTTTTCT GCTCAGGGTG TGGTTTGGGA ACCAGGAGTA TTAAGGGTAT 
131501 CTGAGAACCT GTTAATGATG TAGAATCTCA ATTCCCCTCA TGTTTACTGG 
131551 TTTACATACA TACATTTTGA GAAACACTGC ATTTGGAGAT TCCTGCACTA 
131601 GATACAACCC AAAGAAATAA CTGAGGGATA TACAGGATAT GGTCAGAGGG 
131651 AACTCTGGAC ACCCTCTGGC TGTTCTCATG GGTTGAATAT GATGAGCTTA 
131701 ATAAAACAAC TAGCGGCTAT TAAAATCTGT GTGGGCTTTG AGTTAGCAGT 
131751 CTCCTGCAGC TTTGGTGCAT GCAGCTTTGG CAACTACAGG AGATAGCTGC 
131801 GGAACTCTTG GCAGTGCTGC AGTGCCCTTA ATGCTGGCAT TGGTGGCCTT 
131851 GATGGCATCT GCACTGGGTA TCATCTGCAT AGCTGCCACT GGGCTTGCAG 
131901 CAAGAGGCTT CAACCCAGCA AGCTGGAGCA GGAGAGAGGA AATGTGCTGT 
131951 CAGCCTCTTA TTGCCAAGAG GGCTAGTTTT TGGAACACAT CTCACTTCAT 
132001 ACCAGATGAA ATGGAGCCAT CAGATAGCTG GCCTGAACTT ACAAAATTAC 
132051 TTTCACGGCA TCCATTCTTT CCTTCTTCTC TTCTGTTACA TTTAGGTTCT 
132101 GCTTCTATGA CTTCATGCTT TCTCAAGGAT CACGTTATCA TTTGTCTTTT 
132151 TGATCTCCTA TATCTTTAAT GTCTCTTTTA ATATGGACAT CTTCTAATAA 
132201 ATCCTTAAAT ATGATCAACT ATTTCTCTCT TAAAAAAGTA GCACCTCTGA 
132251 TGCCACCTTC TGCACCAGCT ATGCTATCTT TAACCATCTC TTTAAAGATA 
132301 AACTTCTTAA AAAGTTGTTT ACAGTTATTG CTTTCATTTC TTTATCTTCC 
132351 ATTCATTTTA TTGTCACAGA TTTTTATGAC AACTGTCACT GATTTTTTAA 
132401 AGCTGTGAAA TATAAATGCA GAACGATTAC AGAAAATACC TGTGTATAGT 
132451 TTAACAAATA GTTAAAAAGC AATGCAAGAG TAAACACCTC CCTTCTCGAG 
132501 AAATAGAATA TTACAACCAC TCCAGAAGCC TTCATGCATC TTCCAATGTT 
132551 ATCATCTCCT TCCTCTCTAG GTATAAACAT TTTTATCTTT TGTAAATCCT 
132601 TTCTTTATAG TTTTATCCTC TTCATATGTA TGACTAAACA ATATAATTTT 
132651 TCCTGTTTTT CAGTTTTATA GAATATAATC ACTCTGTGTG TATTCATTTC 
132701 TATATCTATA TCTCTCTATT CAAAATCTTC TTTTACTCAA TGTTATGTTT 
132751 ATAAGATTTA TCCATAATGT TGGGTGCAGC TGTAGTTCAT TTTCATQGCT 
132801 GTATGGTATT CTGTTGCATG AATACATTGC AATTTACTTA TCCACTCTAC 
132851 TCTAAATAGA CATTCGGGCT GTTTCCAATT TAGAGCTATT ATGAATAATG 
132901 TTTTTACTTA CATGTTTATT CGAGTATCCT GGTACACGTA TGTAAGCATT 
132951 TCTGTAGGGT ATATTCCTGG GAATAAAATT CCTGGGTTAT AGGTCAGGTA 
133 001 TCTTTAATTT ACTAGATAAT GCCAAACCTC TTTCATTCCA TCAACCATGT 
133051 TTAGTGGGCT CCTTCTCTTG ACATCCTCAA CAATATTTGA TAGTGCCAGA 
133101 CATTTTACCC TTTGCCTATT AAGTGTGATG TGCAATGGTA TTGAGTGTGT 
133151 CATGGCATTT TAATATAGTT TTAGTTTTTA TTTTCCTGAG TATTAATGTG 
133201 TTTGGCCACG TTTTCACATA TATATATATA GACTATTTAG AATTC ATTC A 
133251 TGTAAGTTGC TATTAAACTC ATTTACCTGT TTTAAAAAAT GGATGGTCAA 
133301 TTTCTTATTA ATTCATAGAT ATTTTTTATA TATCTTGGAT ACTTGATTTT 
133351 CTAGTTATTT TTTCTGCAAA CACCTTTTGC CAATCTGAAC TTGTCTTTTT 
133401 ATTATCTTTG TGATGACTTG ATAAATATGA GGTCCTAACA TTATATATAT 
133451 ATATATATAT ATATATATAT ATGTTTTGTT TTGTTTTGTT TTGTTTTGTT 
133501 TTGTTTTGAG AGAGGGTCCG GATCTGTTGC CTAGGCTAAA GTACAGTGGT 
133551 GTAATCACAG TTCACTGCGG CCTCAACCTC CCGGGCTCAA GTGATTCTCC 
133601 TGCCTCAGCC TCCTGAGTAG CTGGGTGGTG CATGCCACCA GGCCTGAATA 
133 651 ATTTTGATTT TTATTTTGTA GAGACAGTAT CTCACTATGT TGCCCAGACT 
133701 GGTCTCTAAC CCCTAGGCTC AAGCAATTTT .CCCACCTTGG CCTCCCAAAG 
133751 TGCTGGGATT ATAGGCATGA GCCACCATGC CTGGCTGAGT TCTTAATTTT 
133801 AATGGAGCTC AAATTTTAAT GGACTTAAAA ATATTAGTAA TTTTCTTAAT 
133851 TTATAGCACT TTTTGTGTCT TATTTAATAA ACTATGTTTT TAATCAGAAC 
133 901 TTCTGAGGAA TTGTCTTGTA TAATCTTTAT AATTTATCTT TCACATTTAG 
133 951 GTCTTTAATT TACCTGGACC TGATGGTTGC GTATAAAGTG AGGAAACGGT 
134001 CCAATTTTTT GTCTTTTTGT ATGAATACTC AACAGTCTAA AAACTATTCA 
134051 TTGTAATATC TGTCATTTTT CCACTTGTCT GCAATGCCAC CTCTGTCATC 
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134101 TATCACGTGT TCATTATAGG TAAATTTGTT TTTCTGTTCT TTATTCCATC 
134151 TGTCTACCCA CATTTTGTTA CTATCGTTTA AAAATAAGCC TAACGTACAG 
134201 TAAAGGGCGC TCTCCCAGCC TGTTTTTCTT CTTTGAGAAT GTCTTGGCTA 
134251 TCTTTGGCCC ATTTGCATTT CCATGTAAAT TTTAGAATAT GCTTTTCAAG 
134301 TTTTGTACAA ATAACCATAG TAGGATATTG TATGGCATTG CATTAAATCT 
134351 ATAAATCAGT TTTAATATAA TGGACATTTT TAGTGGTGTT TTAATCCATA 
134401 AACATAATAT ACCTCTCCAT TTATTTAATA TAGCTCAATA AAGTTTATAT 
134451 CAAAATATAC AATATTTGTT AAAGCATTGC ACAGATATTT TTGAATTTGC 
134501 TTCTAGATTT GATACTACTA TGAATGCCAT CATCTTCAAA ATTTTATTTT 
134551 CCAATAGTTT GCTCTGATAT ATTGAATACT AGTTGTTTCC ACAAATTTAT 
134601 TTTGTATCTG GCAATGTTGT TAAACCCACA TATCATTTTA AATAATTTAT 
134651 CTGTAGACTA TTTTTGAACT TTATACATAC ACAATCATGC CATCTGTGTA 
134701 TAGATACAGT TTTCATTTTT CTTTTCAAAT TCTTATAAAC TCGATTTCTT 
134751 TTTATCTTAC TGCATTGGTT TAGATCTCTG ATACCATGCC AAATTAAATG 
134801 GTGACAGTGG AAATCCTTGT CTTATTTCTG GTGATGAAGG TAAAGATTTT 
134851 AAATATGACA TTTGATCAAA TATTTTTGTA TATACTTTTA TTTGGATTAA 
134901 AGAAATTCCT TCCAATACTG GTTTCTAAAG CGTTTTTTTT TTCAATAAAT 
134951 GTGTGTTGAA TTTTAGTGAT CCTTTTTTTT CTGAAATCAC TGAGATTATT 
135001 ACTCTGATCT GTTAAGGAAG TGTATTGATT AATTTTCTAA CATTAAAAAT 
135051 GAAAACTTAC CTTTTAAACC TGAATTATTT TCATAGTGAT GTGTTTGCTT 
135101 TGTTAGCATT TTAAGATGTT TGCATGCATA TAAATAAGTA AGAGATCTGT 
135151 AACTTTTCTT ATTATGCTGT CCTTGTTAGG CTTTTATTTC AAATCTATGC 
135201 TGTTCCATAA ATGCACTGAG AGCTATTCCC TTATTTCCTG TGATTTTTAA 
135251 CTCCTCTTCT ATACTTACTA CATAACGTTT ACATTATTGA AAATATGCTT 
135301 AGTAGAATTC CTTGATAAAG TAATTAGGGC ATTGTGATTT CTTTGTGGAA 
135351 AATTTTAAAA ATTACTGATT CATTCACTTT AAATGTCTAT AGGGCTATTG 
135401 AGATTTCCAA AGGTTTTTTG ATTTTGTTTG ATATGTTACT TTTTTTCCCT 
135451 GGACTAGACA CATTTGTTTT AATTTTTAAA AGATACTTGG AAGGAAGTTG 
135501 ATTTTGATAT CTTTTTATTA TCCATTTCAT GTCTCTAGCC TCTGTCGTGA 
135551 TATTCCCTTG TTGGTTCTTT GTATTGCTAA TTGTGTTTTC TCTCTTAATT 
135601 TTTTCAATAA CTCATACTAG AAGTTTACAC AGTTTATTAG TCTTTCCAGA 
135651 AAACTGTTAA TCATCGCTTT GCAAGTTAGT TTTCTTCAGT CTTTTCTAGT 
135701 CATTTTAAAA TTATTTTCAT TTATAACTTT ATTGCATTTA TAACTTTTAT 
135751 TTGCTTTGTG TGTTGTACTT TATCTAATTT GAGATGGTTT CACAGATTAG 
135801 GTTCTTCAAT GCAGACACTG AGGTGAAGTT GGTGGTGCAA GATGAGAGGA 
135851 AGCTGGCTTA CGGCACAGGG AGAAGCTGAA CTTCGGTGCA ACTCCCACAA 
135901 ACCTTAGCCA ACTTGGTGGG GAGCTTTGGT GCCAATATTT CCTGTTGAAG 
135951 TAGTCCTGTC TTGGACTTCT CAGCATTGTA CTAGGCCCTT AAGCACCCTC 
13 6001 TTCAATCAGT TATTAGATAC AAGACAGCTC AGGAAGACCG TGGCATTAGG 
136051 ACAGGCAGCT CTCTGCAGCT GAGGCAGCCC CGGTAGGAGC TGACAGCTGA 
136101 AGATTGTCTG GAAACTGTAC CACCTACCAC CAGGCAGTGA GTCCTTCTCC 
13 6151 GAAAAAGGAT CTGGGTCAGA TGGATGCTCA TTAACTTTCA CGCACCTTTG 
136201 TTTTCTCACA TACACATCAA GTCTGTGAAT GTCCTTCTAA TCTCTGCTTT 
136251 ATGTGTATTA CACAGTTCTG CGATGTGTTT TTGTTAGTAT TCAGTACAAA 
13 6301 CACATTTAAA AACATTTGTC ATTATGATTT CTTTTTTGAC ACATAGATAT 
136351 AGTTAGATGT ATTTTTCTTA ACTTCTAAAA TGTATAGTTT TTGCCAGCTA 
136401 TATTTTTGTC ATCAGTATTA GTTTAACTAC AACATGCTTA TAAATATAAT 
136451 CGGCATGATT TCATCATTTT AAACTTTATT GAGACTTGCT TTATGTCCCA 
13 6501 ATAGTCAATT TTGTAAATGA TATTCTTCAA AAAAATGTGT ATTCTGTAGT 
136551 TGTTAGGTGT TCCTATTCGA GAGGCTGTAA CCAGGGTCTC CAACCACCAG 
136601 AAAGGAATGC TATCTGCTAG TGGCTACTGT GGATTATTTT TTCATTATCC 
136651 TCCTTTAAAA TGCAAATATC CTTTGGAGTG TTGAGTAATG AACGTCTTAA 
13 6701 AATTGGGTAT GAGAATGAAC TGAATTACCG AGGGAGTGGG ATTTGTGGGA 
136751 rAAGGACACAC ACACTCACAC ACACTCACAC ACACACCACT GTTTATTGCA 
136801 GGCTTAGTAA CATGCAAGTT TATTGTGTTC ACATACATTA TTTCACTGAA 
13 6851 TCCCCACAAT AATTCTGAGA GAAGGGTATT CTACCCATTT TATACATAGG 
13 6901 AAGCTGAGGC TCAAAGAGTT TAAAGAGCTT GCTAATATTT GAACTCAGAG 
136951 GTATCTGATA CTGGATTCCA TGCTCTTAAT TGCTATGTGC CTGTCTCCCC 
137001 ATAACTAGCC TCAGTTCCTT TAATACACTA ATTTGGTCAT CTGATTCACA 
137051 ATGCAGTTAT CATTACATTT GTAAAGGTTT TCTCTGATTC TAATGGATAT 
137101 TGAAAAGATA AGTGATGGCA GAATCACTTA TTCTTTAATT TTTAAGCATT 
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137151 ATCAGTGAAT CATTATGTAG ACGGCTTAGG CTATGAGCTC AAGATTATGT 
137201 TATGAGCTCA AGAAATGCAA AGGTAAAGAA AACACAGTCC AGTCCAGTAA 
137251 GATTGGGAGA CAGACATGTG TACACACAAT GAGAAGTCAT GAAGGGAAGG 
137301 TGGGCTGGCT GTGGAATTAG TCACCTTGAA GTCTGTCTAA GGAGAGGGCT 
137351 GGTATCAGGG TGTGGACATT TTACATTTCT ATCACAGCAT TGTAGTTGAT 
137401 CTTAGAGACT CCTACCGAAG TTTGGGGAGA AGAGAGGTAA TACAGGGATT 
137451 TATTTTAAAT TTCTTCTTAT AATTAGGTCA ATTCTAGAAT ATGAGGAATG 
137501 AATGACAGTT CAAACACAGT GTCTGAAAAA GGTAACCACA TTTTGTAAAG 
137551 ATTCCTTAAA TCAATAGGTT GAAATTCCAG TAATAGAGCT CCAAAATTTC 
137601 AGGGGAGCCT ATCCTATGGG AAAGCAGAGC CCTGTTGCTG GAAATCCACT 
137 651 AAGCCATTTA CAAGGAGTAA AGAAGGGCTT TCTGAAATGT TGACTTAGGG 
137701 ACACAGAGCT TAAACTCAGA GCACTTATTT TCTTCTCTGC TCTCCGGCTT 
137751 TCTGTCTCTG GCTGCATCAT TATTCTTTAT TGACTATTTC ATAAACTCAC 
137801 ATTCCCTCAG CTTGGTTAAG GAAGGCAGTC TTTAATCTTG GCGGAGCAGA 
137851 TAAGGACAAT TAAATCCTGG GCCTGCTTCC TGACACAGAT GACGGGGCTG 
137901 CCTTCCTGGA ACACCATCTT CATGTCATGG GTTCTGCTGT CACTTATTTT 
137951 GTTCGGAATG TGTCAGGACA GCTAGCATGA CCTCAGCATT CCCCTCAGGC 
138001 TGGTGCTTCT TTAATGCTGA ATAAAATGTT CTCCTGGACT CTTTCAGGGT 
138051 GGATTTGAGA CCCAGCCCTG TGGGCTGAAG GAGGATGGGT GCAGGTCTGT 
138101 GAATCTCTTC TTAATCTTTC AGCTAGAAAT AACTACTCCT TTCTGTGGTC 
138151 CTTTTGCAAT TTACTCATGG CTTCTTTTCT TTTCTTTTAC TTCTTTTATT 
138201 TTCTTTTATG ATTGTTCTTA CAATAGCCTG CTTTGATCTC CACTACGTCT 
138251 GGTTATTGGA GAATACATCA ACTGTTTTGT TGACTCAGCA ATACTTAATC 
138301 AATTCTAATT TCAGCACTAT TTCTTTTGAA AATTCCTCTC ACCTACTTTT 
138351 GCCAGTATAG TAAGGCTATC AGTTTCAGCA GTAGGTTTCT TTGATCTCCA 
138401 GAACAGGTTT AGGATCCAGG TTTGGCCCAT CATAGTACAC CATTATTTTG 
138451 GCTTCAGTGA TTGGCTCCTC AGTGAAATCA TGATCCTAAA GTTGACCAAT 
138501 CAGAGGTTTT TCCTGGGGAT TTTTATATAT GGATACTGGG GACAGATGGT 
138551 ATTTTTTTCC TCTTGTAGTG CAAGCTGAAA GAATGTGGTT ACAAATCTAA 
13 8601 TTATGGTGAT GCCCTTTAAC TTATGGATGG GCATAAGGGA GTGAGGCCAA 
13 8651 AATGCAACTT TAAGCCAAGC GAGAGATAGA GAGATGGAGA CTGCTAGTCG 
138701 GGATGCTGTT TGAATCCCTG AATGCAGTCT TTCCTGAGAC TAAATTATAT 
138751 TTAGTCTTTT AACCGTAACT GCTAAAAGTA ATTTTTTACT TTTAAACTTA 
138801 GAATATAAGT TCAAAGAGTG AAAGCAGAGA TTTTTCTTTA CATTTTCACT 
138851 GCTATAATTC CAAATGCCTA GAACAGTGCC TGTCACATAG TAGGCAGTCA" 
138901 ATTAATATTT GCTTAATGAA TATTTGCAAC TGAATGAAAC AGGATAATCA 
138951 GAGTATGCAT GTCTTCCTAA CTGTTTTAAG GGATTCTTGG GAAAAAAACC 
139001 CAGGGAGCAA TTATCATAGA GTTGTTGGAA TTAAAGCTAG TTTGTGGGAC 
139051 TGTCCTGAGA ATTTATTGTT ATTGAAATTT TTTTAAATTT TGAAGTGTAA 
139101 CACACAGAAA AGTGCATAAA GCACAAATAA GCAGTGTAAT GGGATTATTT 
139151 ATAAAGTGGA CCCATGACCA CTAGGTAGTT CAAGAAATAG AAGGCTTCCA 
139201 GCCTTCCAGA AGCTTCTCGT ATTTCCCATC TGGAATATTT CCCTCTGTAC 
139251 CAGCCAGAGG TAAACAACAT GCTGAATTTT ATGGTAATCA CTTCATTGCT 
139301 TTGTTTTATA ATTGTAGCCC CACTATATGC ATCACAGGAC AATGCTATTT 
139351 AGTTTTGAAT TTGTGAACTT AATAAAAATA AATTTCACAA TACATATATT 
139401 ATTTTGTGTC ATGTTTCTTT AACACTACAT TTTTATTATT CGTCTAAGTT 
139451 GGACATCGCA TTACTTTATT TTCATTGCTC TGTTTTATAA ATAAAACATA 
139501 TTCTGCTGCT ACAACAATTT TTCCCAGTTT TGGCTGTAAA AATCTATGCT 
139551 GCCATGAAAG TTCTTAGACA TGTATCCTAG TACCTATGTG TCCCCATTTT 
139601 CTATGGCATA TTTACCTAGA AGCGAAATTG CTGAGTCATT GGGTATGCCT 
139651 ATTTTGAACT TTAGCTAATA ATGTTTTATA TTGTCCTTAA TACGAGGCTT 
139701 TCTCTGATTT TTATTATAGC CAATTCAATG GTTGTATAGT TTTATCTTGC 
139751 TATAATTTTT ATTTGCATTT TTCTGATTAC CAAAGAAGCT GATCTTATTT 
139801 TGTTTATTGG CCATTTGAAT TTTTCTCTTG TATACAATAG CTAGTTATGT 
139851 TTTTGGCCCA CTGTTTATTG ATTTGTCTTT TTCTTTTCAA TTTTAGTTCT 
139901 GTGTGTGTTC TGGATACAAG TCCTTTGAAT GGTAATATGG CTAAAGATAG 
139951 CTTCTCACAT TCTGTGGCTT GTTTTCTTCT CTTTCTTTAT AATATGGTTT 
140001 AATAAATAGA AGTTCTTAAT TCTATTGTTG TCAAGTACAA CATAATTTTT 
140051 TTCTATGAAT AATAACTTCT GTGACTTTTA TGACTTAATT ACCAAAGTTT 
140101 CACACCATCA CTTTGCCATG TTCAATTTAT TAGACAGAGT CACTAAATAC 
140151 AGCCCGCAAT TAAGGGAAGA AGAATT^AGC TCCACATTTT CACGGGAGGA 
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140201 GTCCCCCAAA AATTTGCAGA CATCTTTCAA AATCATCACA CATGATAAAA 
140251 TCAGGTGAAG GTTTTTGAAG AGGCTGGGAT ATATTTAGAG CAAAGAAGGC 
140301 TTAAATGAAA TAAAGAAATC ATAGATGTTG GAGAGAGGTG AGATTAGAAT 
140351 ACGTGTGTTT TGTCTTTCAT AGGTATTTTC TTTTTGTGCC CCAGATCTTT 
140401 GCAGAAGCTG TGCTTAGTAG TGATGGACGT TTAGTTGGTG TGTGCTAAGG 
140451 ACTTATTCAC TGGCTGATTT GTTGTAATGG ATAAATTCTC AACTCGTGAC 
140501 CAGATTTTTA TGATCGCTTT CCCAGTTAAT CTGATTAGTT ATCTGATTAG 
140551 TTCAAAATAC TGGTTGCCTT GCCCTAAATA CTGAATTTAG GGTCAGATCT 
140601 TTTTTTTTTA TTATTATTAT ACTCTAAGTT CTAGGfcTACA TGTGCACTAC 
i40651 GTGCAGGTTA GTTACATATG TATACATGTG CCATGTTGGT GTGCTGCACC 
140701 CATTAACTCA TCATTTGACA TTAGGTATAT CTCCTAATGC TATCCCTCCC 
140751 CCCTCCCCCC ACCACACAAC AGGCCCCAGT GTGTGATGTT CCCTTTCCTG 
140801 TGTCCATGTG TTCTCATTCT TCAATTCCCA CCTATGAGTG GGAACATGCG 
140851 GTGTTTGGTT TTTTGTCCTT GCGATAGTTT GCTGAGAATG ATGGTTTCCA 
140901 GCTTCATCCA TATCTCTACA AAGGACATGA ACACATCATT TTTTATGGCT 
140951 GCATAGTATG ACATGGGGTA TATGTGCCAC ATTTTCTTCA TCCAGTCTAT 
141001 CATTGTTGGA CATTTGGGTT GGTTCCAAGT CTTTGCTATT GGGAATAGTG 
141051 CCACAATAAA CATACGTGTG TATGTGTCTT TATAGCGGCA TGTTTTATAA 
141101 TCCTTTGAGT ATATACCCAG TAATGGGATG GCTGGTTCAA ATGGTATTTC 
141151 TAGTTCTAGA TCCCTGAGGA ATCACCACAC TGACTTCCAC GATGGTTGAA 
141201 CTACTTTACA GTCCCACCAA CAGTGTAAAA GTGTTCCTAT TTCTCCACAT 
141251 CCTCTCCAGC ACCTGTTGTT TCCTGACTTT TTAATGATCG CCATTCTAAC 
141301 TGGTGTGAGA TGGTATCTCA TTGTGGTTTT GATTTGCATT TCTCTGATGG 
141351 CCAGTGATGA TGAGCATTTT TTCATGTGTC TTTTGGCTGC ATAAATGTCT 
141401 TCTTTTGAGA AGTGTCTGTT CATATCCCTC GCCCACTTTT TGATGGGGTT 
141451 GTTTATTTTT TTCTTGTAAA TTTGTTTGAG TTCATTGTAG ATTCTGGATA 
141501 TTAGCCCTTT GTCAGGTGAG TAGATTGCAA AAATTTTCTC CCATTCTGTA 
141551 GCTTGCCTGT TCACTCTGAT GGTAGTTTCT TTTGCTGTGC AGAAGCTCCT 
141601 TAGTTTAATG AGATCCCATT TGTCAATTTT GGCTTTTGTT GCCATTGCTT 
141651 TTGGTGTTTT AGACATGAAG TCCTTGCCCA TGCCTATGTC CTGAATGGTA 
141701 TTGCCTAGAT TTTCTTCAAG GGTTTTGATG GTTTTAGGTC TAACATGTAA 
141751 GTCTTTAATC CATCTTGAAT TAATTTTTGT ATAAAGTGTA AGGAAGGGAT 
141801 GCAGTTTCAG CTTTCTACAT ATGGCTAGCC AGTTTTCCCA GCACCATTTA 
141851 TTAAATAGGG AATCCTTTCC CTTTCCCCAT TGCTTGTTTT TGTCAGGTTT 
141901 GTCAAAGATC AGATAGTTGT AGATATGTGG CATGATTTCT GAGGGCTCTG 
141951 TTCTGTTCCA TTGGTCTATA TCTCTGTTTT GGTACCAGTA CCATGCTGTT 
142001 TTGGTTATTG TAGCCTTGTA GTATAGTTTG AAGTCAGGTA GCATGATGCC 
142051 TTCAGCTTTG TTCCTTTGGC TTAGGATTGA CTTGGCAATG TGGGCTCTTT 
142101 TTTTGTTCCA TATGAACTTT AAAGTAGTTT TTTCCAATTC TGTGAAGAAA 
142151 GTCATTGGTA GCTTGATGGG GATGGCATTG AATCTATAAA TTACCTTGGG 
142201 TAGTATGGCC ATTTTCACAA TATTGATTCT TCCTACCCAT GAGCATGGAA 
142251 TGTTCTTCCA TTTGTTTGTA TCCTCTTTTA TTTCATTGAG CAGTGGTTTG 
142301 TAGTTCTCCT TGAAGAGGTC CTTCACATCC CTTGTAAGTT GGATTCTTAG 
142351 GTATTTTATT CTGTTTGAAG CAATTATGAA TGGGAGTTCA CTCATGATTT 
142401 GGCTGTTTGT TATTGGTGTA TAAGAATGCT TGTGATTTTT GTCCATTGAT 
142451 TTTGTATCCT GAGACTTTGC TGAAGTTGCC TATCAGCTTA AGGAGATTTT 
142501 GGGCTGAGAT GATGGGGTTT TCTAGATATA CAATCCTGTC ATCTGCAAAC 
142551 AGGGACAAAT TGATTTCCTG TTTTCCTAAT TGAATACCCT TTATTTCCTT 
142601 CTCCTGCCTA ATTGCCCTGG CCAGAACTTC CAACACTATG TTGAATAGGA 
142651 GTGGTGAGAG AGGGCATCCC TGTCTTGTGC CAGTTTTCAA AGGGAATGCT 
142701 TCCAGTTTTT GCCCATTCAG TATGATATTG GCTGTGGGTT TGTCATAGAT 
142751 AGCTCTTATT ATTTTGAGAT ACATCCCATC AATACCTAAT TTATTGAGAG 
142801 TTTTTAGCAT GAAGTGTTGT TGAATTTTGT AAAAGGCCTT TTCTGCATCT 
142851 ATTGAGATAA TCATGTGGTT TTTGTTGTTG TTTCTGTTTA TATGCTGGAT 
142901 TACGTTTATT GATTTGCATA CATTGAACCA GCCTTGCATC CCAGGGATGA 
142951 AGCCCACTTG ATCATGGTGG ATAAGGTTTT TGCTGTGCTG CTGGATTCAG 
143001 TTTGCCGGCA TTTTATTGAG GATATTTGCA TCGATGTTCA TCAGGGATAT 
143051 TGGTCTAAAA TTCTCTTTTT TTGTTGTGTC TTTATAGACC TACAAAGAGA 
143101 CTTAGACTCT CACACAATAA TAATGGGAAA CTTTAACACC CCACTGTCAA 
143151 CATTAGACAG ATCAACGAGA CAGAAAGTTA ACAAGGATAT CCAGGGACTG 
143201 AACTCAGCTC TGCACCAAGC GGACCTAATA GACATCTACA GAACTCTCCA 
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143251 CCCCAAAGCA ACATAAAATA CGTTCTTTTC AGCACCACAC CACACCGATT " 
143301 CCAAAACTGA CCACATAGTT GGAAGTAAAG CACTCCTCAG CAAATGTAAA 
143351 AGAACAGAAA TTATAACAAA CTGTCTCTCA GACCACAGTG CAATCAAACT 
143401 AGAGCTCCGG ATTAAGAAAC ACACTCAAAA CCACTCAACT ACATGGAAAC 
143451 TGAACAACCT GCTCCTGAAT GACTACTGGG TACATAACGA AATGAAGGCA 
143501 GAAATAAAGA TGTTCTTTGA AACCAATGAG AATAAAGACA CAACATGCCA 
143551 GAATCTCTGG GACACATTGA AAGCAGTATG TAGAGGGAAA TTTATAACAC 
143601 TAAATGCCCA CAAGAGAAAG CAGGAAAGAT CTAAAATTGA CACCCTGACA 
143651 TCACAATTAA AAGAACTAGA GAAGCAAGAG CAAACACATT CAAAAGCTAG 
143701 CAGAAGGCAA GAAATAACTA AGATCAGAGC AGAACTGAAG GAAATAGAGA 
143751 CACAAAAAAC CCTGCAAAAA ATCAATGAAT CCAGGAGCTG TTTTTTTGAA 
143801 AAGATCAACA AAATTGATAG AACACTAGCA AGACTAAGAA GAAAAGAGAG 
143851 AAGAATCAAA TAGATGCAAT AAAAATGATA AAGGGGATAT CACCACCGAT 
143901 CCCACAGAAA TACAAACTAC CATCACAGAA TACTATAAAC ACCTCTATGC 
143951 AAATAAACTA GAAAATCTAG AAGAAATGGA TAAGTTCCTC AAGACATACA 
144001 CCCTCCCAAG ACTAAACCAG GAAGAAGTTG AATCTCTGAA TAGACCAATA 
144051 ACAGGCTCTG AAATTGAGGC CTGTTATTGG AATTATCTGA AATTAATAGT 
144101 TTTCCAACCA AAAAAAGTAC AGGACCAGAT GGATTCACAG CCGAATTCTA 
144151 CCAGAGGTAC AAGGAGGAGC TGGTACCATT CCTTCTGAAA CTATTCCAAT 
144201 CAATAGAAAA AGAGGGAATC CTCCCTATCT CATTTTATGA GGCCAGCATC 
144251 ATCCTGATGC CAAAGCCTGG CAGAGACAAA ACAAAACGAT CTTAACCAAT 
144301 TAATTTCAAA TTACTTTTGA CTGTAGTCAG AATGATTAGA AGTTCATGTT 
i44351 TCTCCTGCCT TTAGACATCT TAATACATCC TCCAACTTTT TCTTACTTTT 
144401 TTTTTTTTTT TTTTTTTTTG AGACAGAGTC TTGCTCTGTT GCTCAGGCTG 
144451 GAGTGCAATG GCCTGATCTT GGCTCAATGC AACCTCCATC TCCTGGGTTC 
144501 AAGCGATTCT CCTGCCTCAG CATCCCAAGT AGCTGGGATT ACAGGCATGC 
144551 ACCACCACAT CTGGCTAATT TTTGTATTTT CAGTAGAGAC AGGGTTTCAC 
144601 CATGTTGGTC AGGCTGGTCT CAAGCTCCTG ACCTCGTGAT CTGCCTGCTT 
144651 CAGCCTCCCA AAGTGCTGGG ATTACAGGTG TGAGCCACCA CGCCTGGCCT 
144701 ACATCCTCCA ACTTTCTCTT CTATTTCTTC CTTTTGAAAA CTAGCCATCT 
144751 TATTTTCCTA TTCCCATCCT ATCTGTCATA TTTAGAGAGA TCTGTCAAAT 
144801 TTATCCTTGA TGCAGACTTA TGCTTAATAG CTTAACAAAA CATTACTACA 
144851 TCCATTTATT TTTTGCATTT TAGTAACATA TTTTGGAGAT AGAAAACTTT 
144901 AATATAAAAT AATTACTTTC TTATTAGTAC AGGAACGTAG TGTTCATTCA 
144951 TTCATATACG TGCTAATACC CCAATAGTTG GCACAAAGGC CATGAAAAGA 
145001 CATGAGCTAT TCTATCCACA AACAAGATTT CCCCATCTAA CTATATCTAG 
145051 CTCTGCCTGC TATACTTGTT TGCCCTTGAA TTTATCCAGA CCCGAAGCAG 
145101 CAGCATATTG GCTTTGCACT TGTGTTTTTG TTCCTTCTGT GACTTTGCTT 
145151 CTCTTCCCAA TTAGCCACTG TTCTAATCAA TTTAATCCAT CTCTGACAGC 
145201 ACGCCATATT TTCCTATCTT GTTGATGAGC ATGTCATGTG GAACAGAGTC 
145251 AGAAGCCTCC CTGATGTCAA AATATACTAT GTCTGTCCCT TCCTTCTTGC 
145301 ATACAACGTC TGACACTCTG TCAGAGGAGG CAATTGAATT GGTCTAACAT 
145351 GATTTGTTGG TCACAAAGTC AAGCTGATTA TTACCTATCA TTCTGAGCCC 
145401 TTCCATGTGC TTGCAAATTG. ATTAGGCATC TTGATCTTGT GGTTAATGCA 
145451 CTTTTTGTCA TTGTGTAGAG TCTGGGACAC TTCAATGCAA TCTTGGGCTA 
145501 CTGTGAGAAT ACATTTACCA TCTATTCCAC TTTCATTAAA GATAAAGAGC 
145551 TCATTTGCTT TTATAGCTCA GAAATTCTAA GAAAATTCTT AATGCTCTCA 
145601 AGGACTCTTG GGAATCTTAA TAGTAGCCTA GATTTGGCAT CACAAGAGTT 
145651 AAAACGAAAT CAAGTTTCTT ATCCTTATTA CCAGGAAAGA CAACACACGT 
145701 TTGTTGAGTC CCTGCTATGT AAAAGGCATT TCTGCTGCTT GGTTATGCGA 
145751 TAGATACAAA GTTGTATAAG ACGGTCTGAA ATCTTGTGTT GTCTGTAGAG 
145801 AACACGGACA CATGAGAAAC TAATTGACAC CAAAATATTT TATGTGAACA 
145851 ATGGGGAAAA AAAGACCATA CAGCTTTTTG GTTGGTTGTG TGTTATTAGG 
145901 AATCACAGTT TCTTTGTGCC TGCTTGTCTA GCTAAATCAT TTAAAATAAT 
145951 AATGCAGACA TATTATTTGC ATATTTCTTT TTTAATTTAA AATTTTAAAT 
146001 GATCCATAAT AGATGTACAT ATTTTTGGTG TTCATGTGAT AACAGGTACA 
146051 ACCATATAAT GTGTAAAGAT CAAATCAGTG TAACAGATAC CCATCACCTT 
146101 ATATTTATCT TTTCTTTTTG ATTTGACTGC TTCCTAAATT TCGACTCATC 
146151 TTTTATTTAT CTACATCCTC AAAATGCCTA GTGGCAGTGA GGCTGATTTA 
146201 TTTATATTCT TGGTGTCATT ACATGTTTCC TATTGTTTTC TTTGTCTTTT 
146251 TCTTTTTATC AGCTTTGTAC TTATGCCTAG GTAAGGGCTT GTAGGTATGT 
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146301 TCTTTCAAGG ATGGGAACAT TTTCCCAAGA TTCTTGATCG TAGGGTCTCT 
146351 TGAGAACTCT CACTGTTACT GTTCTAGGGT CTCTTGAGAA CTCTCACTGT 
146401 TACTGTTCAA TAAAGGCTTG GTAATAACAT ATTATCACCA TCAACAAAAT 
146451 ATAAAAAATA TAACTCCTTG TATTTATAAG CATATTGATA AGAATCAAAT 
146501 TATTAATGAA AGAATAACAA AAATGCTTGG CAAAAAATAT CCAACGACCA 
146551 CCTTTTTTAT TTTGCTTTTT TTCCAAACAG ATTTATTTTA AAATACAATT 
146601 CTGAATTAAA AAATTAAAAT ATTCCAGGAC AAAACATTTG TCACTATACT 
146651 CAGCTACAAT CCTTTGAGGA AATACCTTTT CTATTGCTTA TTGATAGGCT 
146701 TCTGTCATTA CAAGTTTGAG GTCTCAAGAG CTGGGAAATA TTAGGATATT 
146751 GGATGATACC GGATGATATT AGATAATTAT TGATAATTCT ATTAGGTGTG 
146801 CTGAGGTACT TAGGAAATTA TTAGTTCTCA GAGAAGGATA CTGTTCTACT 
146851 TAAAGTGTAG TGCTATAGGT AAATTCCTTG GAAATGTTTG GCAATACATA 
146901 ATACATGTAC ATAAATACAA TACATATATG TGCATATACA TGTAAAATAT 
146951 CTATTTGAAG TCTCAGGTCT TGAGCCCAGA AGAAGCTTCT TGGAAGCTCT 
147001 TTTCCCCCTG CTTCTTCTGA TCACTGCATT TTTTCATGGC TTTATGACCC 
147051 CACAACCATC ACCTTTACTT TTTCTCTTCT ATGCCACTCT TACTAGTTCT 
' 147101 AATGTATTTT CCCAGTGTAT GAAAAGGGAA TCTTATATTA TATCATCCAG 
147151 TCAATATTTT GAGCCTCACA GTGTTCTGCA GTCTATTCTT TTTAGCCAAG 
147201 TGAAAATGAA AAGGAGGCAA CAGAGAGGGA TGCCCATGTC TGCATCAAAA 
147251 CCTGCAATGC GCACACACTC CTCATCTGCT TGTCTCCTCC CTTGAAACAC 
147301 ACGTAGGGTG TGACAATGAA GATCTGATAA GGAAAGTAGT AAAAAAAGAT 
147351 CAATAAATAA ATTCTTCAGA CAAGTGGCCC TCAAGACAAG TTTGCTGAAT 
147401 CTGTAACTTA GACATTTAAA AATCTGCCAA AGTCTCTCCT TGCTAAAGAA 
i47451 ATATCATAAA AGGCTCTCAT AAAAAGGCTG ATGTTTTCTC CAGACATAAA 
147501 AGGACAGAAA CATTTCAAAA GAGTTATGAT ACAACTTGGC TTTTTTCTTG 
147551 GCATAAACAA CATTTCCTTG AGGTTAAAGG GTTGAAGTAG ATCTAGTAAC 
147 601 TTTAGAATTT TATTTCAATA AAGCAAAATC AATCAATAAG ATGTATGCTG 
147651 TTGAAATCCA AGCAAACTCT GGCATAGTGT AATTGCACAA AAGTGTCTGA 
147701 GAAAAAAGAA TGTAGATCAA TAATTGATTT TCTCTCATTT AAATTTTTTA 
147751 TTCTCTGTCG GTCAGGATAC AGACTAAAGG GAGTGAAGGC CTTTTTAAGC 
147 801 TCCCTGCTTA TTTGAAGTGT GCAATAGATG GATAAAAGCA AGCTTCTTCC 
147851 AAACACAGTT CTCTTGCTTT CTCATCATTT TGAATCTGTG GATGCAAAGA 
147901 ACAATTTTTG AAGTACATTA AAAACAACGA CTACAGAAAA GATCAAGCTC 
147951 ACATTAATAA AAGGAAGTAT GGGCTAGTGC ATACATTTGC TGCATAAAGC 
148001 ACATGGAATA TACTTTTTCT GATTCTCATT CATTCAAAAT TTTGAAGCAG 
148051 CCTAGAGCTG TAAGAACTTT GGAGGTCTTT TATTTTGGCT TTCCATTTTT 
148101 AGAACTCCTA AGTGGTGGTG ACCATCTTAT CTCCAGGTGC TGGGAAGACT 
148151 CCCACTGCAT GAAAGCTGGG CCTTTCTATT ATTGGGCAGG TTAAATTCTA 
148201 AGAAATCTTT CAGTTTTATT GGATACAATC TATTTTTCAC TGATGGATTC 
148251 TGACTTTGTC TTTTGCAGCT ATGAGTAATA GCTAACAAGG GAGGCAATAG 
148301 ATTGTAGAAA AGTGCCTCTC AAGTGTGGTT CCTTCACTAC CAGCATCAGC 
148351 CTCATCTGGG AAGTAATAAC AAATACAATT ACAAGGTTCT ACTCCAAAGA 
148401 GATCGAATCA AAAACTCTGG GGGTGGGACC CAACCGTCCA TGTCTGAACA 
148451 GCCCTCCTGG TGATTCTAAA ACATGCTAAT GTTTAAGAAT CCCTGCTTTA 
148501 TCTATTCGTC TTATAAATAA TGAGTCTCAT TCTAGGACAG TTTCTAGAAC 
148551 TGCCTATTAC ATTCCTCTGA ACAAGGTATG GGTTGAATAA GCAAACCCAA 
148601 GTGTACCCCT TTTTTGAAAA CAGTCTAAGT CAAACTATCC AATCCAATTC 
148651 AAGAAGCAGC AAGCCCCTAC CTTGATTGAT CAGATCAAAG AACACGAATA 
148701 AAAAAATAGG TTTGTCATAT CTGCAAGGTA AGTAAAGATG CATTTTTTGA 
148751 TTATTAGCAC CATGTAACAG AAGCTGCTAG ATGGAACAAT TCCAAACAAC 
148801 AAAAACAAAA AGTGGTGGTG AAATTTTTCT TTTTCTTTCT TTTAAAAATA 
148851 AATTTGGATT TATCATTTCT TTTATGATGC TTTAAATGTC AGGGAAAGCC 
148901 TATCCACAGT GTGCACAAGT CTAATTAGGA TTATAATAAG TAACTGCTGG 
148951 AATGATTTTG AAATTGGCGC TAATCTACTC TTGTTACTCC GTGCTTGAGG 
149001 AAACTGAGAC TTAACGTAGA TCAATAATCC ATCCTCATAA AATTTAGTTA 
149051 CATGCAGACA GTTAGTAGCA GATTCTGCAC TCGGAAACAG TCTCTGGATT 
149101 CCCGATCCTC CAGGCTTTCT CATCCATGTC CGGCTGCATT TCTTTTCTTA 
149151 TTCTGGATTT ATTCTTGTTA CCATGGAAGC TACAGATATA GAATAGTATA 
149201 ATATAGGACT TTCAGATTTA GAGCAGTTCA GGGCTGTGAA TGTAGCTTAG 
149251 CACATCATGG TGCAAACATA AAATTAAAAA CAAATAAAAA CAGCCAGGCA 
149301 CGGTGGCTCA CGCCTGTCAT CCCAGCACTT TTGGGGGCTG AGGCAGGTGG 
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149351 ATCATTTGAG GTCAGCAGTT TGAGACCAGC CTGGCCAACA TAGTGAAACC 
149401 CCATCTCTAC TAAAAATACA AAAATTAGTC AGGCGTGGTG GCGCGCACCT 
149451 GTAATCCCGG CTACTCGGGA GGCTGAAGCA GGAGAATCTC TTGAATCCAG 
149501 GAGGCGGAGG TTGCAATGAG CTGAGATCGT GCCACTGCAC TTCAGCCTAG 
149551 GCGACAAGAG CAAAACTCAG TCTCAATAAA TAAAATTTAA AAAACATGAA 
149601 TACTTTAAGG ACACAAAAAT GCATCGTTCC TCAGTTCATT TGTTTACTTA 
149651 TTTATTTACT CATTCGGTCA TTCACTTCCT GTGTTTAATA ATTTATTTCA 
149701 TAGGCGTTTT CTTGGGACCC ATTCATTATG TGCTCTTTTT TTTTTTTTTT 
149751 TCAGACGGAG TCTCGCTCTG TCTCCCAGGC TGGAGTGCAG TGGCCTGATC 
149801 TTGGCTCACT GCAAGCTCCA TCTCCTGGGT TCACGCCATT CTCCTGGCTC 
149851 AGCCTCCCGA GTAGCTGGGA CCACAGGCAC CTGCCACCAC GCCCGGCTAA 
149901 TTTTTTGTAT TTTTAGTAGA GACGGGTTTT CACCGTGTTA GCCGGGATGG 
149951 TCTCGATCTC CTGACCTTGT GATCCACCCG CCTCGGCCTC CCAAAGTGCT 
150001 GGGATTACAG GCGTGAGCCA CCGCGCCCGG CCATATGTGC TCAATTTTTA 
150051 CCTGGTCACT GTGTGAATAT TGGGTATTAT TTACAGTTGA ATGCCACTTC 
150101 TATGAAGGTA AGGGCTGGGT TTTATACAAT TTATAAGCTT AATAGTGATT 
150151 AGTTTACTGT CTTAAAAATG TCAATATTTA TTTGGTCAAT CAGTGCAGAA 
150201 AGCACAATCA AGAAAAAGAA TTTATAATAA ATGTCCTTTC CCCACCAGAA 
150251 AGTGGAAAAG TCAGGAAGAG CTACCATGAT CCAGAGAAAA GGGAGCAGAT 
150301 GTCAGCAGAG GCATCAGGAC TGGGAAATCA TGTGATGAGG GGGCATTTGA 
150351 AGGAATTTCG GAATCAGGGA AGTGATTTTA GGCTGCAGAG CACGAGTAAA 
150401 GGCTTCTTGT TCAGAAGAAC CAGTGAGGCA TTTGGGTGAT ATTGAAGAAG 
150451 AAGGTGTTAT TCTTCTAACA TTATAATTTG GGCTCCTTAA TTTTCTTTAA 
150501 GTAATAAAAC CTAGAAAATT ACACAACTGA ATTTTTTTTT CAACTTAGAA 
150551 GTGGCATGAA AACTGGATTC ACGGGTGAGG AAAGTATTTG GGTAGGAAGA 
150601 GAGGATTATT ATGATATTTT CCAGGCTTCT CCCTGCCAGA ATACTTGGCT 
150651 TTCTGGTTTC CATTACTGTA CACTGCCCAG TGTGTTTATC TGTAAAACAG 
150701 TCATCCACAT TCATTTTCTG AGCAACTATA ATGAACAGGA AATTATGTGC 
150751 TTAGAATGAG AAACAAGATA ATTGTAAAAA CTGTTGACTC AATTATTTCC 
150801 CAGGCATAAC CCCTCCGGTA GTACCTACCC ATCTTTATGT CTAAGTGGTC 
150851 TGCCCTAATT AGATGTATGT TGGTAGATTA TGGAGAGCTA ATTAAAATGT 
150901 TGATGAGACC AGGCCAGTAA AGCAAGAGAA TGAAGCAGGC GTAGGTGGAT 
150951 ATTCTTGTAT TGAACACCTG GAGAAGGTTT GCCTTTGTAA ACAAGACACC 
151001 TGCTTCCAGA GACCAAGCTA GTCACTGCCA TAATACAAAT GTTAGTCAGT 
151051 TGCTATAAAT TCTGATATTT TCTATGAAAA GCCAAGAGTA CCATCAATTG 
151101 ATGAATGGAT GAAGMAAGA TGATATACAT ACCCAATGGA ATATTATCAA 
151151 GAAGGAAAAA GAAAAAAGGA AATCCTGTCA TTTGTGACAA CTTGGGTGAA 
151201 CCTGGAGGAC GTTATGCGAA ATAAAATAAG CCAGGCACAG AAAGAGAAAT 
151251 CTTACTTAAA TATGGAGTCT AAAAAGTCAA ACTCATACAA GTAGAGAGTA 
151301 GAATAGTGGT TACCAGAGGC .TAGGGATGGG TAGGAATGGG CAAATGGGAG 
151351 ATGTTGATGA AAAGATATAA AGTTTGAGTT AGACAGAAGG AATACATTTT 
151401 AGTCATGTTT TGCATGGCAT . GGTCACCATG GTTAGTAATA ATGTACTGCA 
151451 CATTTCAGAA ATAAAGGGGT AGATTTTAAA TACTCTCACC ACAAAAAATA 
151501 GTAAGTAGTG GGGTGATGAA TATGCTAATT AGCTTGAGTC AATCATTCCA 
151551 CAATGTATAC ATGAAGTAAA ACATCACATT GTGCCCATAA ATATATACAA 
151601 CTATCATTTG TCAATTAAAA ATAAAATAAA AAGAAAAGCT AGAGATATGA 
151651 ATTTCATGCG AAATCACCCA ATTTTTAAAT AGTTATACAG CACTTTCAGC 
151701 CACACAGAAC AAGACAGCAG CCCTCATTTG GACCTAGGCA TCAGTTTGGA 
151751 AAGTTGACAG GTAGATAAAT CTCAAGCACT TCCAGGAGGC AAGTCTGTAG 
151801 AGGAAGAGCT GGATATTTGG TCTGGAATTT TGTGATCTTT CCCACATATT 
151851 GTAACTAGCC CCTGACTTCT TAGGAGCTCT TGTGTCTGAA GGCAAACAAT 
151901 ACATCACTAA CTCGATGCTG TACCAAGTCT GATTTGCAAA ATTTATGTCA 
151951 AATAAGCCTC CTGTTAGGGA ATTTGATTTG GCATATAGCT CCTTCCAGAC 
152001 CTCAfiACTGT CTTTCCAGCC AGATAATGTA CTCATCATAC CCAAGATGTT 
152051 CACTAATGCT GCTCAAAAAG TGTGAGGATC CATTGAGGTT TGAATAAAAA 
152101 TGCTAAGCCT ACTGTTATGA ATTGGGAATA TTTACCAGCT CTGATCCAGA 
152151 TAGTAAAAGG AACCAGTTGT GCATTGTGAA AATGCTAGAA TGAAATGTAA 
152201 GCCTGGCATT TTGTGGCAGC AGCAGGATGA GACCTCATAC AGTGGGCCCT 
152251 TCTGCAGCTT CTGGAACCTG. TGTGGAAATC ACAGCAAGAG TATTTATTAT 
152301 GCATTGTTTC TTTATTATAT ATTTGTGTCT TTGGGTTAGG CTGATTATAA 
152351 AGACAGTGGT CTTATGGTTT ACAAAATGAT CATTTCCTGC AATGTCTTTA 
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152401 GAGTGGTGTT TCTCAAATGT TTTGAACCAT GACCAAACAT GAAATACATT 
152451 TTTCATTGCT ACCTGGTATA CACACACACA CACACACACA CACACTCAAG 
152501 CAAAATTCCA GGAAACAGTG TTTCTCTTAC TACATAAAAT CCACTGTGGT 
152551 ATTTTATTTT CTTTTTATTT CCCTTCTAGA CTAGCCTAGG CGAAACTAAT 
152601 TTATTCTATT CCACTTAATA AAATATGTAA AAGATGTTAC AATCCATTAG 
152651 TGCGTTGCAA CCCACAATTT CAAAATCACT GCTCTAAAAT ATGTTAGAGT 
152701 TTATAGAGAT CCTTGGGCAC TCAACCTGAC TGCCCAAGTC TAATGATGAA 
152751 TATATCCTTA AATTCCTGTA ACAGCAACAA GAAGAGAGAA AAATAAAAAA 
152801 CAAAGAGAAA TAAAGGAGAC AGAGAAAGAG AGAGGGAGGA AGGGAGGGAG 
152851 AGGGAGGGAG AGAGGGAGAG GGAAAGGGGG GGAGAGAAAG AGAAGGAAAG 
152901 AGGAGGAGAC AAAAGGAAAG AAAAGAATCA ATTTCAATTG ATGATTCCAA 
152951 TTCTTTAGTT CTTTCATTTA TTATCTGAAT ATCTATGATT AATGGAGGAC 
153001 ATATTAAGTG CAAGATGGGA GTAGTATTAG GTGTCTCCAG TGCAATGTTT 
153051 AACAAAGCAC TCAGGTTCTT AGAGATGCCC CAAAAGTAAA ACTTAGCTAG 
153101 TCCAACAAAC AAATAAATTG TAAATTGGCA AGAGTCAGGG CAAAATCGAC 
153151 AGAATTGTGT TCTGTAAAAG AAGATGGTAT AAGTAATTTC AGGGATACGA 
153201 ACTGTAGAGT ATTTGGGCCA GGCAGAGTGT TATACTAGGA TTTGGTATAG 
153251 AATAAGAATG CAGTGTCTGG TTCTTGAAAC CTGGAAAGTC TAAATCATAG 
153301 TGTGATTCAT TTCTCTTTTT GTGAGAGTTG TTTTAACTAG ATCAATTCTG 
153351 GATAAAGCTG ATGTTTTTGC TATTGTTGAT GATGCTAAAT ATGATAAATA 
153401 TGGTAGCATT TATAGTGCCA GACACTAGAC AAAGGAATTT GAATGCATTC 
153451 AATCATTTAG TAGCTTGTAT CTTCATTCTT CCTTCTCCAC TTGGCTGGAC 
153501 TATTCTTCCT TCACATGCAG ACAGCTCTCT TTCCACTTGG TCCTGAATGA 
153551 TCCTACAACT AGTTTCTTTG TAACTGACTT CGTCAAGTGC TCATTGAGGG 
153601 CTGGTTGATG GAGACAGAGT GAGTAGTGCC AGGATTATGC TTGCTGTATT 
153651 CATTTTCGTC GACCATCTAT ATATTTCCAA TCTTTGTACA GTGCACTGTA 
153701 GGAAC TAAAG AGCCAGGGGC ATACTTAGCT AAGTATTTCC AAAATGTTTC 
153751 AAATAAGTAC TGTGGATTAT CTATAAGCAT CAACACATAA TCCGGCATCA 
153801 TTTCTCACTC CTCATCTAAC TTGAGTTATA GTTTATAAAA CATTTTCTTC 
153851 GTTTCCAGGA TGGCTTAGGG TCCTGCTCAT CTTATTTTTC TACTTCTACC 
153901 TGCTTAGCAG CTTCTTGTTT CTTTTTTCTC CCCTCGTGTA CTTGTTCCTT 
153951 CTTATCTCTC TGACCTCTAA ATGTTAATGG GCCCCAGAGT TCGGTTTGTG 
154001 AAGGAGACCT TAGCTAGTCT TATTACTAAG TACCATCTTT ATGCTAATGA 
154051 ATCCCAAATG CATATCTTCA GCCCAGGATT TTCCCCTGAA TTAAAAATGT . 
154101 GTATAGACAA CTTTCTACTT GATATCTACA CTAATATGGC TAAAGGGTAT 
154151 TGTGTAGAAA AGAGTTAAAA CAGCAAACCC GAGACAATGA TCTTAGAAAG 
154201 GTTTGCTXGA ATGGTTGACC CTTGGTTGTC ACTGGAAGTT GGCTTTCGAA 
154251 GTGTTCCCAG TCCTCAGTTA ACTGATAAGA GTGACTCGCT GTGTCTAAAT 
i54301 TGTTTGTACT AAACAGTGTG GTTTATACTG AACACGTATT CCTTTTGGGA 
_ 154351 GTCTAGGATA TTTTGGTACG TGCCACACAG AGGGCCCCTA TAAAAACACC 
154401 CTCACAACAA AAATGGGGCA CAGAGACCCT AAAAAAACCC CCCCACAAAA 
_ 154451 CAAAACATTG CAAAAATGGC CCCAAAACGT GGCCACAGAA AAAAAAAAAA 
154501 AGTGTCCCCT GGGGGTCCAC AAGAGAAGAG GAGTGGTGTT GATGACGGAT 
154551 TTTTTTTTTT TTAATAGACG GAATCTCACT CTGTCACCCA GGCTAGAGTG 
154601 CAATGGTGCG ATCTCAACTC ATTGCAAGCT CCGCCTCCTG GGTTCAAGCA 
154651 ACTCTCCTGC TTCAGCCTCC CGAGTAGCTG GGATTACAGG CGTATGCCAC 
154701 AACGACTGGC TAATTTTTGT ATTTTTAGTA GAGACGGGGT TTCACCATGT 
154751 TGGTCAGGTC TGGTCTCAAA CTCTTGACCT CGTGATCTGC CCGCCTGAGC 
154801 CTCCCAAAGT GCTGGGATTA CAAGTGTGAG CCACCACGCC CAGCTGAAGA 
154851 GGAGTCTTAG AAACTTGCAC CTAGTTGCCT CTGGACTTCA TTCCACGTCC 
154901 CTTTTCCTTT TGTTTACGTT ACTTTGTATC CTTTTGCTGT AATAAATCCT 
154951 AGCCATGAGT ATAACTATAT TCTGCATCCT GTGAGTTCTT TCAATGAATC 
155001 TTTGAAGCTG GGGACAGTCT AGAGGGCCCC CGGCACAGCC ATCTTAACAA 
155051 TAAATGTCCA AAACTGAGCT CTTGATATTC TTATCACCAA TC^AGCAAAT 
155101 AAAAAAAAAA CCATCTAAAC GAATAAAACA AACCAGCAAA TAATAATGAT 
155151 TTTTCCTACA TTTTTTATAA TCTTCCGTAT TTCATTAATG GCAATTTTAT 
155201 TCCTTCATGT TACCCAGGCC AAAAATCTTA TCTTTGATAT TGTCTTAGTC 
155251 CCTTTTGTGC TGCTATAATA AAATACTTAA GACTGGGTAA" TTTATAAAAA 
1553 01 TCAGAGATTT ATTTCTTACA GTTCTGAAGG CTGAGAAGTC CAAGGTCGAG 
155351 GGGCCCATAA TTGGCAAGGG CCTTCTTACT GTGTCTTCCC ATGGCAGAAG 
155401 GCAGAAGAGC TAGAGAGCAT GTGTGAAGAG AAAGGAAAGG CAGAGGGGCT 
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155451 GAGCTTTTAT CCTTTTATCA TGAACTCACT CCTGCAATAA TGTCATTAAT 
155501 CCCTTCGTGG GGAAGGAGCC CTCATGACCT AAACACCGTT TTAAAGTCCC 
155551 ATCTCTCAAC ACTACTGTAC TGGAAGTTAA GTTTCCAACA TACAAATTTT 
155601 AGGGGACCTA TTCAAACCAT ACCAGTCTGT TTCTGGCCCA ATTTTTGTCC 
155651 TTCTCACAGG CAAAATATAT TCATTGCATC CCAATAGCTC CAAAAGTCTT 
155701 AACTTGTTCT AGAAAATAAA AATTAAGTCC AAAGTCTAGA GTCTCATCTA 
155751 AATCAGATAT GAGTAAGATC AAGGTATAAT GCATCCTGAG GCAAGTTTTC 
155801 TGTAAGCCCA TGAAACTAAA CAAGTTTCAT ATATTTTCAA AATACAACAG 
155851 TGGGACAGGC ATAGGAAAAA CATTCCTGGC TGGGTTTGGT GGCTCATGCC 
155901 TATAATTCCA GCACACTGGG AGGCTGAGGT AGGAGGATCA CTTGAGGCCA 
155951 GGAGCTTGAG ACCAGCCTGG GCAACATAGT GAGACCCTGT CTCTACAAAA 
156001 ATAAAAATAT AAAAATTAGC TGGGTGTGGT GGCACATGCC TATATAGGTC 
156051 CCAGCTACTC AGGAAGCTGA GGTGGGAGGA TTGCTTGAGC CCAGAAGGTC 
156101 AAGGCTGCAG TAAACCATGT TTGTGCCACT GCACTCCAGT CTGCTAAGTG 
156151 ACAGAGCAAG GCCCTGTCTC TCTAAAAACA AAACATTTCC ATTTTAAGAG 
156201 GAAGAAATAA ACAAGAAGAC AGGAATAACT CATCTCCAGT AAGTCCAAAA 
156251 CCCAAGAGGG AAAACAATAT TAAATCTTAT GACACCAGAA TAATCTCCTT 
156301 TGACTCCATG ACCTGAATCC TGAGCACACT GGGGTAAGAG TTGGGCCTGC 
156351 AAGGCCTCAG GAAGCCTTGC CCCTGTGGGT TTTCTTGGCT CAGTCTACCG 
156401 AGCACCTCTC ACAGGTTAGA GTATCATGCC TGCAGCTTAT CAAAGCTGGA 
156451 GTTGCCTGCT GGTGGCCCTA CAGTTCTGGG ATCTTGGGGG CTGCCCCACT 
156501 CTCATAGTTC CACTAGGCAT TGCCTTACAG GGGACTCTTT GTGGTGGCTC 
156551 TGTCCCCATG ACAAGTCTCT GCTTGGGCTC CCAGGCTGTC CAAGATATCC 
156601 TTTGAAATTT AGGTGGTGGT TGCCATGGCC CCACAGCTCT TGCATTCTAC 
156651 AAAACCTGCA GAATTAGAAC . CACATGGATA CTGCCAAGGC TTTTGGCTTA 
156701 TACCTCCTAG AGTGGTGGAT TGAGCTGCAC GTGGGCCTAC TTGAGCCACA 
156751 GCTGGGGCAG CCGAGGAGCG CTGCACTGGA ATTTGGGGAG CAGAGTTCCA 
156801 ATGTGCAGGT CCTCGAATGC TGAGGTCCTG CAGATGTCTC TCTGGAAACC 
15 6851 TTGACCTCAA GGTTTTAGCT TGCCTTGAAA ATCTCTGAAA TGCCTTTGGG 
156901 GTCATTCTCC TATTGTCTTG ATGAACAGAA CTTGGCTTCC TTCTAGTCAT 
156951 ATTAGCGTCT TTAGCTAATG GTTGCTTGGC TACACCCTTA GTGTCCTCTT 
157001 CTAAACAAAA TTTTTTATTC TTACATGGTC AGGCTGAGAG TTTTCCAAAT 
157051 CTTTCCATTC TGCTTCCCTT TTAATTATAA GATCTGCTTT TAAGTCATTT 
157101 ATCTTTTATC TCATTTTATT GTAAGTGGCA AAAATAAACC CTGCAGCACC' 
157151 TGAATGCTTT GCTGCTTAGA TGTTTCTTCT GCCAGATATT GTAGTTCATT 
157201 GCTCTTATGT TCTACTTTCT ACAAAGTCCT AGGACATGGG CATAATTCCA 
157251 CCAAATTCTT TGCAATTCTA TAACAAGGAT GGCCTTTACT CCAATTTTCA 
157301 ATACTTTTTT TTCTTATTTT TATCTAAGAC ATCATCAGAA AGGCCTTTAC 
157351 TCTCTGTATT TCTACCAAAA TTCTGATCAC AACCATGTAA GTAATCCCTA 
157401 AGATTTAGTC TCTCCCTATA GCCTTTGTCT TCTTCTGAGC CATCACCAGA 
157451 ATCACCTTTA ACGCTCTGTT TATGGCAACC TAGGCTTTTT TCTAGGCTGC 
157501 TCTTCCAAAT TTTTCCAACC TCTACCCATT ACTCAGTTCC AAAGCCACTT 
157551 TCACATTTTC AAGTATTTGT TATAGCAACA ACCCTCCTCT TAGTACTAAT 
157601 TTTCTGTCTT TTTCCCTTTT GTGCAACTGT AACAGAATAT CTGAGACTGG 
157651 GTAATTTATA AAGAATAGAG ATGTTTTTCT TCACAGCTCT GGAGGCTGGG 
157701 AATTCCAAAG TCAAGGGGTC TACATTTTTT TGAGGGCCTT TTTAATGTGT 
157751 CATCCCACGG TGGAAGGCAG AAGGGCATGA AAGCACGTGA GCAAGAGAGG 
157801 GAAACCACTC CTGTGATAAC AGCATTTGTC CATTTGTGAG GGCAGAGCCC 
157851 TCATGACCTA ATTACCTCTA AAAAATCCCA CCTTTCCACA TTGTTCCATT 
157901 GGGGAGTATG TTTTTAACAC ATAAACTTTG GGAGACATTC AAACCATAGC 
157951 AACATCTATT TTTATCACAA CCCACACCTG CTCTGTCAGC ACAGCCTGTA 
158001 GGGCCAGGTT CTAAATATAA CTAGAATTCA TTTCGTTTTC ACCAACCTCA 
158051 CTAGTACAAC AGGTCCTAAC TGTTCACCTA GGTTACTCTC AACCATCTAT 
158101 TTTGTTAAAG AAGTCAGGAA GCATGAATCA GATTATTTAA ATCCTCTGCT 
158151 CACCAGCCTC TAATGCTTAT TCATTGCATT TCTGAGTAAA AACCCAAATC 
158201 CTCACTATGG TCTGTCTGAT CTGGACCACT TTAAACTTCA CCCCACACTC 
158251 TAATGCTCTC TCTGTTGTCA GGCCTTTCCT GCTATGCCTC TATGTAAACC 
158301 AGGCAGACCC TTCCCTGGGG TTCTTTGGGG CTTTTTAAAT TATTGTTTCC 
158351 TCTGCCTGAA ATGTTCTTCT CTGATAGTCA CATGGCCAGA TCCACCCTTC 
158401 AAATGTCATG TTCTCAATGA GGAACCTCCT GGCTATTCTA TCTAAAGTTT 
158451 TCCACAGCCA ACAGTTTTTA TCTCACATCC TGTGTTTTTC TCTTAGTTAT 
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158501 ATTTTATTAT CTAATATACT GTATGGACTG AATTGTGTCC TCTCCCTGCC 
158551 AAATTCATAT GTTGAAGCTC TAATCCCCAG TGTTAAAGAT AGGACTTTAA 
158601 AGAGATAACT AAGGTTAAAA TAAGGTCATA AGTGTGGGAT TCTAATTCTA 
158651 TATAATTGCT GTCCTTGTAG GAAGAGGGAG AGATACCAGG AGCACACACA 
158701 CAGAGAGAGA ACACCACTTG AGGACACAGT TGAGACGACG ACCCTCTGCA 
158751 AGCCAAGAAG AGAGGCCTCC CCAGAAACCA AACCTGCTGA CACCTTGATC 
158801 TTGGACTTCC AGCCTCCAGA ACTGTGAGAA AATAAGTTTC TAGGTTTAAA 
158851 TGACTCGTCT ATGGGGCTTT GTTCATGGCA GTCTGAGTTG ACTAATGCAT 
158901 ACGCTAACCA TTTCATGTAC TTATTTTGCA TACTGCCTGT CTTCCTTTTA 
158951 GAATGTAAAC TTCACCTGGA GAGGAATTTT TATCTGTTTT TTTCACCACT 
159001 GGAATTCTTA GTACTGATAA GGTGCCTTAG CTGTAGTAAG AATCAGTACT 
159051 ACTTATTGGA ATGAATAACC AATAGGTGAG AAACAGTCCT GGTTTACACT 
159101 GCTTTCTTTT ATCTCTGTGG TAGACTAATT GCAAAAATGG CTCTAACTGT 
159151 TCATTGCTCT TTTACATGCC CTTTGCCATG TGTCTTTGCA GCTCTTCCTA 
159201 TACXAAGGTA GAATCTGTTT TTTTTTTTTT TTTTTTTTTT GAGATGGAGT 
159251 TTCACTGTTG TCACCTAGGC TGGAGTGTAG TGGTGCCATC TTGACTCACT 
159301 GCAACCTCCG CATCCCGGAT TCAAGAGATT CTCATGCCTC AGCCTCCCGA 
159351 GTAGCTGGGA TTACAGGCGC CTGCCACTAT GCCCAGCTAA TTTTTGTATT 
159401 TTTAGTAGAG AGATGAGGTT TTGCCATGTT GGCCAGGCTG GTCTCGAACT 
159451 CCTGATTTCA GGTGATCCAC CTACCTCAGC CTCCCAATGT GCTGAGATTT 
159501 CAGGCGTGAG TCACCATGCC CAGCCCAAAA GTAGAATCTT TATCCTCTTC 
159551 CTTTGAATGT GATTTGGCTT TTTGACTCAC CTTTGCCAAT AAAATATTGC 
159601 AGAAGTATCA GTGTGTCAGT TCCAGACCTA GGTGTCCAGA GGCCTTACCT 
159651 GCTTCCATTC TCCTTCTTAG AACTCTTTGT CATAAAAGCA ATCCTAGGCT 
159701 AGCCTATAGA AAGACAAATG ACCATGTGAA AAAGAGAGAA ATTATCCTAG 
159751 CTAAGATAAT TCTAAACCAG TCAGCCTCCC TGCTAGTTGA CCACAGACAT 
159801 ACAATGAGAC AGGATTACCT TAGCCAACTT TAGCCAAGAC CAGTAAAGCT 
159851 GCCTAGCTGA CCCACAGATT GGCGGGCAAC AATACATATT TATTATTTTA 
159901 AGCCAGCAAG GTTTTGCAGT TTTAGCTATT ACAGCATTAT AGCAGCAATA 
159951 GATAATTGTA TCTGACAGTG GTCCCAAAAA GTATTTTGTG TTTCAGAGAT 
160001 GTCTCTTCTC TTTAAGTGGC AGACTAACTT AAAAGAAAGC TTATGGGCTC 
160051 TGGAACCAGA TTGAATTGGG TTCAAATCCT AGCTGCAATC ACTTACCATA 
160101 TGACCTAGTG CAAGTTACTT AACCTTTTGA CCTTCGCATT ATTTGTAAAC 
160151 , T AAAG AT AG A AATACTTGCT CAGAGCTTTT GTGAGACTTA AAAGAACTCT 
160201 CATGTGAAGT ACCTGATACA ACTACTGATA TTGATCTAAA AAACATCACA 
160251 AATTTTCCCT TATAACTACT CTGTGGCTGA ATCATCAATA GATTAATCTA 
160301 AATGCCTGTA GAAAGTCTAC TTTTTGTCTT ATTAGACAGA AATTATTAGG 
160351 CATCAGCCTG GTATACTTGC TATTGACTCT TCAAAGCTGT ACTTTCTTTT 
160401 AGTCATTCTA ACTGACAGTT TTCAACTTTA AGGGATGGCC TTAAAGCCGA 
160451 TAAAGTCCAG GTTTTAGTGA CTCGATTTAT TTCTACTGTA TTTGTACCCT 
160501 TCATGATGTC ATTGAGTTAG GCCAGAAGGC TTCTCAGTTT TCTATATTTG 
160551 AACAAAAGAG GCCTAATTTC CAAACATCAA CATAGAGGAG AGTGAATGCT 
160601 CTTGATTCAG TGCCATATAC CTGGAAAATA ACTGAGGTAT TCCCAGCAGC 
160651 TGCTGGTAGC CTCCGATAAG TCCAAGTGAA ATATGGGCAT CTAAATGCAA 
160701 AGACAGAGGC CCTGTCCATT CAAAGTCCCA CAAATCAGCC TGTAAATATG 
160751 TAGGCAGCTA TGTGTTTGAA TATTTATTCT TGTATTTTAT GCAGCTTCTT 
160801 CAATAAGAAT TAGAGAAGCA GGTTAAATTT GATGCAATGT CTTAGGCATT 
160851 AGATTCAATT TAGAGAAATA CATGCTAAAG AGAAATAAGG TTTTTTTAAA 
160901 AAAATTTACT TTATTTTTTA ACAGGAAATT AGCTTAAGCA CTGCTTTCAG 
160951 GTAAAACCAT TCTATAACTG ACATCATTTT TAAGTGCTCT ATATTGATTA 
161001 AACACTTTAG CTAGATTTGT AAGTCTGATC AGAAAATATG AGTTGTAAAT 
161051 CACGATGTGA TTTTTCCTTT GATCCCATTT AAACATTTTC TCCTTTGGAT 
161101 CACATGATAA CTTCTGATAA AATTCTGGAA ACTAAAATCC ATTTAATTGG 
161151 TAATGATGTT TCTTGTGAAA ACGCTTAACA CACAATTTTG TTAAAATGCA 
161201 GGAACATGAA TGTATTGGTA TCTGCTTTTC TCAAGAATAA AATTGATTTC 
161251 TTCATACTAA TCATCTCTTC AATAAACTGT CCCTCTTCTA GTCAAGTCAT 
161301 TAAAATGATT CTTTCTCAGA CACTTTTTGA TCTAAGTTAT AACTGTCATT 
161351 TTAGAAAGAA AATTTCAAAA TGATTAAGAT TTTTTGTTTG AAAGGGGCTT 
161401 GATTTCTCCT TAACACTTTA CTAATAGAAT ATGAAGTGAG GATCCCAAGG 
161451 CAGCTTACAT ATTTCTGCAG TTTTGTTTAT TATATTTTCT GAATTCAGAT 
161501 AAATCTGAAC AGGCACCTAT AAAACCTTTA AATATAAGAA CATTCAGTAT 
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161551 TTGGAATACA ACACCACATT TTATTGTGGC AATGGAGACA TTGTAATGGT 
ISlitl SSSSS ATCTCTCCAG GCCTGGTGGA CAACTGAGAA SSSiSS 

161651 TCGTAGAGTG gatgagtgca gaaagccatt tcgtaaggtg atatttgatg 

iSlei CCTGCATTAC ATGGGACTCT ACATTATTTC TCTCTTCTGG GTCTAGATTT 
161751 CATAAGCCAC TGCTTTCAAG TAGACTGAAG TTGGGAACAT ATTATCgIS 
itllll CTAATATTCT ACTAAATATA AGTACCATCT « 

JSI™ AACACAGQTO AAAAGTTGGC ATTCATCCTT TTAAGTGTCC TCCTTTCACA 
161901 GGTGAAGTGT GTAGCCTAAG GTTATGCAAC TACGCAGTAA AATTAGCAGT 
161951 AGATCCCAGG TGTTCTTATT CCCTGGACTG GTGCTTTTTC TACTCTATgI 

162001 AAGGAAAGGT agttttgcta gtgggtaatt tggagcaagg agactcctS 
162051 gtgaaggagt aaagaggttg gagggagagg cagggcagtc tgctcttctc 

162101 CCACCCCTAG TGTTGCCTAC GTTTCTGCTC TAGGGATGTA TGTTTAACAA 
AAACATCAAA GGTGGACTTG ATCAATGGAC TAGAGAGCTG AAAGT^AT 
"^01 AGAACTTTGG CAACCCAAGG GAGCTACAGA AAGAGGAGGG TTTGTAAGCT 
162251 AATGGCAATT CTGTTTTATC TCTATCATCT TACCTCACAG CAGTCATAGr 
1 235 0 i ^CATCCAC CAACCCCACA CAcSgS 

itllm SSS^ TATCTATAAC TATGCCCTAG GTAGCCCACT CCCAAGCTTG 
itlHi GGGAAAATAT TCCCAGTTAG AACATTTTGG GTTGCCCTTA , 

162451 TCTTCAGCAA GTCTCAATCC CAATGTCAGT TCAAAGTATC AGAGATAATT 
162551 1 !S2S225 AGrrTAGGCA ^TATAAGGA ATGAGCAATG 

™^ AATAATCCTA TTTTTGTCAG AGAAAGATAA ACCAGATGGA 

llltli iSSSS aggcaagtaa tattotaaaa atcattataa cacatatagt 

162651 AAAAATAAAG TTCAGACTAT GATACAATTG TAAAATAAAA ATGAGTCTTT 
162701 TTCTTACCCT GCATTCCAGA GGTAACCATC ATTTACTTTC TTAGGCATCC 
162751 TTCACCAAAT GTTCCATGTG TATTCATGCA TCTATGTATC ATCTTTCTGT 
162801 CTGTCTGTCT GTCTGTCTGT CTATCTATCT ATCTATCTAT StcSS 
162851 ATCATCTTTT CTTTTTTTTT TTTTTTGTTT TAAGATGGAG TCTTGCTCTG 
162901 TCGCCCAGGC TGGAGCGCAG TGGCACGATC TTGGCTCACT gSSScc? 
162951 CCTGCAATCT CTGCCTCCGG GTTCAAGCAA TTCTCTTGTC TCmS5 
163001 CTCAGCCTCA GGCATAGCTG GGATTACAGG CGCAGTTCAC CAtScIgC 
163051 TAATTTTTGT ATTTTCAGTA GAGACAAGAT TTCACCATGT TGGOCaSS 
163101 GGTCTCAAAC TCCTGACCTC AAGAGATCCA CCCACGCTGG ACTCCCAaS 
163151 TGCTAGGATT ATAGGCATGA GCCACTATGT CTGGCCCTAT TTTTTTAAAA 
163201 AATAAAAGCA AATGGGAGCT TACTATTCCC ATGTTTTTTT TTTtStwS 
163251 ACTTTAAGTT TTAGGGTACA TGTGCACAAC ATGCAGGGTT GTTACATATG 
163301 TATACATGTG CCATGTTGGT GTGCTGCACC CATTAACTCG TCAtSSca 
163351 TTAGGTATAT CTCCTAATGC TATCCCTTCC CcSSSS ScSctS 
itHll 5™*™ CCCCTTCCTG CGTCCATGTG iS££££ 

163451 TCAATTCCCA CCTATGAGCG AGAACATGCG ATGTTTGGTT TTTTTGTCCT 
163501 TGCGATAGTT TGCTGAGAAT GATGGTTTCC AGCTTCATCC ATGTCCCTAC 
Itltm JSSS^* RACrCATCM! TTTTTATGGC TGCATAGTAT GACATGGGGT 
163601 ATATGTGCCA CATTTTCTTC ATCCAGTCTA TCATTGTTGG ACATTTGGCT 
163651 TAGTTCCAAG TCTTTGCTAT TGTGAATAGT GCCGCAAtS JSSS55 
SCATGTGTCT TTATAGCAGC ATGTTTTATA ATCCTTTGGG tSSS 
163751 GTAATGGAAT GGCTGGGTCA AATGGTATTT CTAGTTCTAG ATCCCTGAGG 
163801 AATCGCCACA CTGACTTCCA CAGTGGTTGA ACTAGTTTAC AGTCcSS 
"3851 ACAGTGTAAA AGTGTTCCTA TTTCTCCACA TCCTCTCCAG ScSottoJ 
163901 TTCCTGCGCC ATTCTAACTG GTGTGAGATG GTATCTCATT GTGgSSK 
163951 TTTGCATTTC TCTGATGGCC AGTGATGATG AGCATTTTTT SSSSJ 
164001 TTGGCTGCAT AAATGTCTTC TTTTGAGAAG TGTCTGTTCA TATCcSgC 
"JJS CfCTTTTTG ATGGGGTTGT TTGTTTTTTT CTTGTAAATT SSSSSS 
164101 CATTGTAGAT TCTGGATATG AGCCCTTTGT CAAGTGAGTA GATTGCAAaI 
164151 ATTTTCTCCC ATTCTGTAGG TTGCCTGTTC ACGCTGATGG TAGTTTCTTT ' 
1-64201 TACTGTGCAG AAGCTCTTTA GTTTAATGAG ATCCCATTTG S2 
164251 CTTTTGTTGC CATTGCTTTT GGTGTTTTAG ACATGAAGTC CTTGcSSg 
164301 CCTATGTCCT GAATGGTATT GCCTAGGTTT TCTTCTAGGG TTTTGATGGT 
164351 TTTAGGTCTA ACATTTAAGT CTTTAATCCA TCTTAAATTA SSS 
164401 AAGGTGTAAG GAAGGGATCC AGTTTCAGCT TTCTACATAT AGCTAGCCAC 
164451 TTTCTCCAGC ACCATTTATT AAATAGGGAA TCCTtScS 
164501 TTTGTCAGGT TTGTCAAAGA TCAGATAGTT GTAGATATGT GQcSS 
164551 CTGAGGGCTC TGTTCTGTTC CATTGGTCTA TATCTCTGTT TTGoSS 
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164601 TACCATGCTG TTTTGGTTAT TGTAGCCTTG TAGTATAGTT TGAAGTCAGG 
164651 TGGCGTGACG CCTCCAGCTT TGTTCTTTTG GCTTCGGATG ACTGGCAATG 
164701 TGGGTTCTTT TTTGGTTCCA TATGAACTTT AGAGTAGTTT TTTCCAATTC 
164751 TGTGAAGAAA GTCATTGGTA GCTTGATGGG GATGGCATTG AATCTATAAA 
164801 TTACCTTGGG TAGTATGGCC ATTTTCACAA TATTGATTCT TCCTACCCAT 
164851 GAGCATGGAA TGTTCTTCCA TTTGTTTGTA TCCTCTTTTA TTTCATTGAG 
164901 CAGTGGTTTG TAGTGCTCCT TGAAGAGGTC CTTCACGTCC CTTGTAAGTT 
164951 GGATTCCTAA GTATTTTATT CTCTTTGAAG CAATTGTGAA TGAGAGTTCA 
165001 CTCATGATTT GGCTGTCTGT CTGTTATTGG TGTATAAGAA TGCTTGTGAT 
165051 TTTTGCACAT TGATTTTGTA TCCTGAGACT TTGCTGAAGT TGCTTATCAG 
165101 CTTAAGGAGA TTTTGGGCTG AGACAATGGG GTTTTCTAGA TATACAGTCA 
165151 TGTCATCTGC AAACAGGGAC AATTTGACTT CCTCTTTTCC TAATTGAATA 
165201 CCCTTTATTT CCTTCTCCTG CCTGATTGCC CTGGCCAGAA CTTCCAACAC 
165251 TATGTTGAAT AGGAGTGGTG AGAGAGGGCA TCCCTGTCTT GTGCCAGTTT 
165301 TTAAAGGGAA TGCTTCCAGT TTTTGCCCAT TCAGTATGAT ACTGGCTGTG 
165351 GGTTTGTCAT AGACAGCTCT TATTATTTTG AGATATGTCC CATCAACACC 
165401 TAATTTCTTG AGAGTTTTTG GCATGAAGTG TTGTTGAATT TTGTCAAAGG 
165451 CCTTTTCTGC ATCTATTGAG ATAATCAGGT GGTTTTTGTC TTTGGTTCTG 
165501 TTTATATGCT GGATTACGTT TATTGATTTG CATACATTGA ACCAGCCTTG 
165551 CATCCCAGGG ATGAAGCCCA CTTGATCATG GTGGATAAGC TTTTTGATGT 
165601 GCTGCTGGAT TCGGTTTGCC AGTATTTTAT TGAGGATTTT TGCATCGATG 
165651 TTCATCAGGG ATATTGGTCT AAAATTCACT TTTTTTTGTT TGTGTCTCTG 
165701 CCCGGCTTTG GTATCAGGAT GATGCTGGCC TCATAAAATG AGTTAGGGAG 
165751 GATTCCCTCT TTTTCTATTG ATTGGAATAG TTTCAGAAGG AATGGGACCA 
165801 GCTCCTCCTT GTACCTCTAG TAGAATTTGG CTGTGAATCC ATCTGGTCCT 
165851 GGACTTTTTT TGGTTGGTAA GCTGTTAATT ATTGCCTCAA TTTCAGAGCC 
165901 TGTTATTGGT CTATTCAGAG ATTCAACTTC TTCCTGGTTT AGTCTTGGGA 
165951 GAGTGTATGT GTTGAGGAAT TTATCCATTT CTTCTAGATT TTCTAGTTTA 
166001 TTTGCCTAGA CGTGTTTATA GTATTCTCTG ATGGTAGTTT GTATTTCTGT 
166051 GGGATCGGTG GTGATATCCC CTTTATCATT TTTTATTGCA TCTATTTGAT 
166101 TCTTCTCTCT TTTCTTCTTT ATTAGTCTTG GTAGCGGTCT ATCAATTTTG 
166151 TTGATCTTTT CAAAAAACCA GCTGCTGGAT TCATTGATTT TTTGCAGCAT 
166201 TTTTTGTGTC TCTATTTCCT TCAGTTCTGC TCTGATCTTA GTTATTTCTT 
166251 GCCTTCTGCT GGCTTTTGAA TGTGTTTGCT CTTGCTTCTC TAGTTCTTTT 
166301 AATTGTGATG TTAGGGTGTC AATTTTAGAT CTTTTCTGCT TTCTCTTGTG 
166351 GGCATTTAGT GCTATAAATT TCCCTCTTCC CATGGCTTTG CACCTTGGTC 
166401 CAGCGTCTGG CTTGTGAAAG ACACTCAACG ATGGGCCCGG CGAGGTAGCT 
166451 CACGCCTGTA ATCTCAGCAG CTTGGGATGC TGAGGCGGGC AGATCACAAG 
166501 ATCAGGATAT TGAGACCATC CTGGCTAACA TGGTGAAACC CCGTCTCTAT 
166551 TAAAAATACA AAAAATTAGC TGGGTGTGGT GGCAGGCGCC TGTAGTCCCA 
166601 GCTGCTCGGG AGGCTGAGGC AAGAGAATGG CATGAACCCG GGAGGCAGAG 
166651 CTTGCAGTGA GCCGAGATTG CGCCACTGCA CTCCAGCCTG GGCGACAGAG 
166701 CAAGACTGCA TCTCAAAAAA CAAAACAAAA CAACAACAAC AACAAAACAC 
166751 TCAATGATGT TATCCATCAC TATTACCTGT GGATTTTGGC TATGTGTCTG 
166801 AATCTTCTTT CTGATCTAAG ACAGCCGTTG TGTCCTGTAT TGTGTTAATT 
166851 TCTCTCAGTC GCCTGTCCAT CACTTACTTC CAACAGTCAT AGTATTCGAC 
166901 CCTTTATAAA AGCGAACCTT GGCATGCTTT AAAGCAGTGG TCTCAGCCCT 
166951 GACTGCACTT TAGAATCACG TGGGGAGCAT TTGAAAACTA CTGATGCCTG 
167001 GGCCCTAACC CAGATTAATT AAATTAGAAC TTCTGAGGGT AGGTCCCAGG 
167051 CATAGCTGGT CTTTTAAAGC TCCCCAGCAG ATAGTAAGGT ACAGCCAGGT 
167101 TATGAAACAC TGGTCCAGAG ATTCAGCTTG TAAGAATCAC TTTGGATCCC 
167151 AATTCTGGTT TGTAGTGTTT TAGCAGTGTT ATGCATAAAT TTGATAACCA 
167201 TGCCTGCTGA TTTCTTCTTT CTTGTCATGG GCAAGAAAAT CTCTGTTAAT 
167251 TGTTCCTTTT TGTTTTTTGT TTACTTTGTA ATTCAACTAT TTTTCTCTAA 
167301 AATTAAATAT TAGAAGTTCT AGTCAAATTG AAGGTTTTGG GGTGAATTTC 
167351 ATTCTGTACA ACTGAGGCTT TATGTATTAA GAAAACAGCA TGATTTCAGG 
167401 TGGTCAATGT GCTTCTATGG CCATAGCATC AGTTTCATTT ACAATGATTC 
167451 CTCTTGATCT CTCTTTCTCC TTCTCTTGAT GAAAATTTCC ATGCTAGGTT 
167501 TCCATCTTTT CACTGTTTTA CTCTGTCAGA CATATGCCTG CATAAACATT 
167551 ACCCAGCCAT GATAACCAAA TTAATGAAAT CCTTACCAGG GCTTTGGATC 
167601 ATTGGTCATT TTTCTAGGAC TAATAGATTT ACTTGCAGTG TATATTGTCT 
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GTGTATATTA 
AAAGTGAGTT 
ATTTTGGCAA 
CATATCTCCT 
GCAAGTTCTG 
AGCATATACG 
GGCTAAGGGT 
CACTGTTCAT 
ATGAAAAATG 
TGGGTCTTAG 
CAGGATTCCC 
ATTTACAATC 
TATCAAATCA 
GATTGTACAT 
ATAGTGATCA 
ATTTATTTGT 
CATATATTAT 
ATTTTTCCCC 
AAATCATGCC 
CCTGAAATGC 
ACACATTAAT 
AACAATAAAA 
AAATCCTCAG 
CATTCCTATA 
CAAACTAGTA 
GATGTTTCTG 
TCCAGTCTCC 
CATGAATATT 
AGAATTGAGT 
AACAATACCA 
TGGGAGTGTA 
GCTCCATGAT 
CAGCATACAT 
AAATGAAGGC 
TATATTTATG 
TACCACTACA 
AAAAAATAAT 
TAAAAGAGGG 
TCTAGGGGGA 
TATGTAGTGG 
GTATTTGGTT 
ACAAAAACAA 
TTTTGAGGTA 
TCTGCATACT 
GTCCAGCTGT 
AGACGTAGTC 
TATTTGATTT 
ATTGTATGCA 
TTTTAAAAAC 
ACAGCCTCCT 
ATTAGGATAT 
ACATAACCAA 
TTTTTAAAAT 
ATGTATCCTC 
CCATTAACCT 
CCTAGCTTTG 
CACTGTTAAT 
GGGGATGAGT 
AGAATGCTAG 
TGTGTTGATT 
CAAAGGTTAT 



TCTGTGl'AAT 
AAGAATATTT 
AGTTGTTCCT 
TTATCTATGA 
ACTTTCACAA 
TGAATGTTTC 
GACTGACAAA 
ACAGTGCCAT 
TGGCAGATGA 
CATTAGACTG 
TTGGATATCA 
AAGGGATCAG 
ACGTACCATT 
ATTTATGGGT 
GGTCAGGTAA 
GTTGGGAATA 
TGTTAATTAG 
TATATAGCTT 
ATTTTTAATA 
ATATATGTGT 
TAAAAGAATA 
GGAAAGTAAT 
GCAAACTCAG 
GTCATTTCTA 
CAGGGATGTT 
TCTAAGTGAA 
.CCTTTGTTTA 
AACCTCTTGC 
GTATCCACCC 
AGTGATAATA 
AATTGGTGGA 
CTAACAGCCC 
GTTCACAGAA 
AAGTCATTGT 
TAATATAGAA 
TGCAGTAACA 
CACAATAGAA 
AATCTAATCT 
AGAAAGAACT 
TCTGGGCGGT 
ATGTAATTAA 
AACAGAAGGT 
GAAATATTAC 
AAACACCTGT 
GTCCTAGGAG 
TAAAGAAAAC 
GCCTTTTAAA 
AGTCAAAAAA 
AAATGAAATG 
AGTTAGCTTT 
TGATGGCAGA 
AAGGTTTGAA 
ATTACAATTT 
AGCCATCCTG 
TGGTGTCTCT 
CAGAAAATGA 
TTAGTAGTTA 
GCTGTAGACC 
GAAAACTAAT 
TCAGGTAAGA 
AAATGACCAT 
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ACCAGGAAGA 
AACATGATTT 
GGAGGAGTCA 
TCTCATGGGA 
AATGCTGTTT 
ACAGTGGTAT 
TTATCCCTAA 
GTACTCTTTT 
CAACTGAGGC 
GATTTTTTTT 
AGTTTTCTCT 
TTTTATTGGT 
TTTCTTTTAC 
TACATAGTAA 
TTAGCATATC 
TTCAATATCC 
AGTCATCCTG 
TAATACAGCT 
AGAAATATTG 
GTATATGTGT 
ATTATAATGT 
TTGGTGAACT 
GAATAGATCT 
TCCATGGAAA 
GGTTTTGTTA 
CAACTCTTGG 
TGTAATGGTT 
AAACATATTA 
ATAAGAATGG 
TAGTACAATA 
GCAAGGTTGA 
TACTCCCAAG 
AATATGTGGA 
GTCTATCACT 
TATGCTATAA 
TGGGTGAATC 
TACATAATGA 
GTTGTGAGAG 
TGGAATGTTT 
GGTTACACAG 
AATATGTAAA 
TCATCTCTGT 
TTCGATTGCA 
ATTATTACAA 
GCAAACTATC 
TTCTCAAATT 
TTTAAATAAA 
TCAGTCTTAT 
ACACAACAAT 
TTTTTTTTTT 
AAGATTCATC 
AAATAATTGC 
GTTATGTGGA 
CCCTTTCTGG 
CCCTAGGTAG 
TCTGATTATT 
TACCTGGTTG 
CAAGTCTGCC 
ACAAAGAAAT 
AATCACAATC 
ATAGAACACT 
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CTTTCCATGG 
TTTTTTTTGA 
CATAGACAGT 
AGTCTTGAAA 
CTGACCATAG 
TTCAAGTCAG 
TTGCAGAGGA 
GGACTGATTT 
CTGAAAATGG 
TCCCCAAAAA 
CACTTCTTTA 
TGCAGCAGAG 
TTTTAAAATT 
TGTTTTGATG 
CATCATCTTG 
TCCTTCTAGC 
CAGTGGTATA 
TTAATTTTAC 
TGTAAAGTCC 
ATATGTATGT 
CTTAGATGTC 
TCTAATGTCT 
TAGCATACAA 
TACTCTGCAT 
CAGACTCAAA 
CAGTTTAAAA 
GGATTCCAGG 
CTGTTTATGA 
TTAAACAAAA 
GAATTCTCAT 
CTAAAGCTGA 
CATGTACACT 
GAAATGTTCA 
ATTAGAATAA 
CAATGAAAAT 
TCACAGAAAA 
ATGATGCCAT 
AAATAAAAAT 
CTAGAGTGTT 
GTGGTTCCAT 
TTTTACTTTA 
ACATGTCTAG 
AGACTTCTGT 
CTAAAATGTT 
CCTGCTGGAC 
TATTATTTTC 
TATATTTTTA 
TTGGCACGAA 
AAGCCAGTAC 
CTGGAATAGT 
AAGCTGAGAC 
AAGAATTCTC 
TGGTGAGCTG 
GATCTCTCAT 
ATAGATTTAT 
TTCTAAAGAG 
GCATGAGTGA 
TGCTAACGTG 
AAACAAGACT 
AAGAACTTCC 
GATTCACCTG 



GCAGCCATTT 
AAGAGAAAGT 
ATCGAGGCTT 
TTATAATAGA 
TCATTTTGGC 
ATTGGTGAGC 
ATTACTAAGG 
GGAAGGGACA 
ATGAAGGGGC 
TACTAATTCA 
CCTCCAAATG 
ATCTCAGCCC 
GACATATAAT 
TATACAATGT 
AACATGTATC 
TATTTGAAAC 
AAACACCAGG 
ATCCTTTCAC 
TCTTTACTTT 
ATAAATAACT 
CTACAAAAAG 
TTCAATACAT 
AATGAATAGT 
GTAGCAAATG 
GACCAAGAGT 
TATAATAGAA 
AAAAAACAAT 
GCAAATGCTT 
ACCCAAACCT 
ACATTGCTGA 
ATATTTGCCT 
TAGCAGCAAG 
TAGCATTCCT 
CAATTTGTGG 
GGACCAACCA 
AATGTTAAAC 
TTGAACAATT 
AGTGGTTAAC 
GGTCATGCTC 
TTGTAAAAAT 
AAAAACATTT 
TGGGACACTC 
CACCTTGATT 
GGCACAAAAT 
ACTAGGTTTG 
TCATCTTTAA 
AACAAATGTT 
TAGATATAAA 
TGTCACACAC 
GTTAACCAAT 
TTTCTAATTG 
ACTACAGGAC 
CTATACErGT 
GACTATTAAA 
AATCCCCTAC 
TGAACCAACA 
GTATACGCTG 
GCCACTGCAA 
TCTCTAATTC 
TCCATAGAGA 
CTCTCATTTT 
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170701 TTTTTTTTTT TGGTGCTCCC AATTTCAGAG ATGTATCTCA ACATAGTGAC 
170751 ATGATTCAAT ATGATTTAAT CCTGTATATG GATACCATAA AAATAATCAC 
170801 ATGAATTATC AGAATCAGGT GTAATACTTC TGTAACATGT TGGAAAGTAT 
170851 GACTCTAGAA AAAATGGTCT AAGTTCCTGT TATGGTTTAG ATCTGTGTGC 
170901 CCATCCAAAT CACATGTCAA ATTGTATTCC CCATTGTTGG AGGTGGGTCC 
170951 TGATGGGAGG TGATTGGATC GTGGGGGTGA ATTTCCCCCT TGGTGCTGCT 
171001 ATATTGATAG TGAGAGAGTT CTTGTGAGAT TTTTCTGTTT AAAAGAGTGT 
171051 AGCATCTCCC CTTCTCTCTC CTGCTCCTAC TTTGGCCATG TAAGACATAC 
171101 TGGCTTGTCT TTCACCTTCT GCCATGATTA TAAGTTTCCT GAGGCCTCCC 
171151 GAGAAGATGA GCAGAAGCTG TTATGTTTCC ACTTCCTGTG CAGCCTGCAG 
171201 AACCATGAGC CAATTGAACT TCTTTACTTT ATAAATTACC GAATTTCAGA 
• 171251 TATTTCTTTG TAGCAGTGTG AGAACAGACT AATACAGCTC CAAAACATAT 
171301 GGGAGAAATT ACTTCAGTAT TAGGTTAAAC TACATACAGT TGGAGTTTCT 
171351 GAAGATCAAA AATATTCAAC TGTTGGCAAT TTCATATTTT TCAAAATAAT 
171401 AGCAGCAATG GTCACGAATT TATTTATATA CAACCATAAT GCAGTGTGGT 
171451 CCTGCTGATG GAAGGAAAAT CAGACTATGT GTCATGATCC AGGAACAAGC 
171501 TCTACCGTTA AAACAGTGTA ATATTGGGCA AATTAGTTCT CCTTGGGTAC 
171551 AGTTTACTCT TGCATGTGGA GAAAACATTA ACTCGATTTA TGTTTCCCAA 
171601 ATGTGCACTA TCAGCCCCTG TGAGGCTGTG GCATTAATTA AATAGGTTAT 
171651 TGAATTTGTG TGCTCATCAA ATATTTTCTG TAGAAATAAA TTATATCTTG 
171701 CACATGTATA TTTGATTCCA TGATGATAAA TGAGTATAAA TCAATGCATT 
171751 CAATTTATAG TAGAATTTTA TAATATCATT ATCTATTTTA TAATTTATAA 
17.1801 GTTTTAAAAG TATAACTATG AAATTTAAAG AATAGTCTCA CTTTGTTGGG 
171851 GGTCTGTTTT CAGCGTTAAA AGATGTTTAA CGCTGAAAAC AGACCAAATG 
171901 CATGCCGGGC TTAAAAACAC GAAAACATTA AAAACTTAAA AACATGAAAA 
171951 ACATGAAGAT GTTTTTCCCA TCTTCATCTG ATAAAAAATA ATATTAAAAA 
172001 ATACTTTGAG CAACAAGGGC CAATTGGAAC TCAGAAGAGG GAAAGAACAA 
172051 ATCAAGGTTG AAGCCAGAAG TCTCTCTGTT TATTGATATG ATGCCTGGTA 
172101 AATACTTTCA TTTGGACAAA ATGGATCATG GAAAATAGAC TAGGGTTATA 
172151 CTCTCATAGA AGTTGGATAA GCTCAAGGTG ACTGAGATTA AGTCTAGAGA 
172201 GACAGAACAA GGCCAATCTG GAAGATAAAC ATAGGTGTGG CTTTTTGAAT 
172251 TTAGGATTGA ATAGAGTCAA ATGAAAAAGA AGTCAGTTCA TGCTTTGAGA 
172301 ATGTTATATG GCCAAAAGAA TCCCCAGTTT TGATTCAATA TGTTTGTGAA 
172351 TATTTAAAAC TTAATATGAT CTTTGGAGCT TCAGTATTCT ACAAAGTACA 
172401 TAGCAGACTG AGAAGGTTTT TCAGCCTTTT ATTAAGACTC TAGTGTTGTC 
172451 TTCATTTGGA ATCAAGGATA ATAAAAAATT AAGGGATTAC CAGAGAGAAG 
172501 AGCCAAGAAT TCCCACTGGG GCCTTCTAAT TCCCAAACCC TAGAATAGTG 
172551 GGTTTTTGCA ACATGATTTC AGAATTGTTT TAGACAAGTG ACGGTGGGTG 
172601 CCTCCAATTC TTTCTGTATA GGGTTGTTTA TTGCAATTCT TCTATCCCTC 

172 651 TTCTACCTTG TCTGTTGGTT GTATGAGGAG CATACAACTT GTCTCTTTAG 
172701 TTCAGAGATT TCCAAGAGAA GAGAAACCAC ATCTTCAGTC CAGATAGGAC 
172751 TGAAGGTACA GGAAGTCACA GGATTTGAGT CTTATACTGC AATTATATGA 
172801 GACTTTGTCT TTTTTTTCCA GGGAAAATGA ATGAATATAT TTTAGGGACA 
172851 TAAGGGAAGA AAATGTATAT TTATGATGCA GAAAATGATC TTTGGCAGAT 
172901 TGTATATTTC TTTCAATTAT CTGCTGCTCT CCTGCGAGAT TATTATCTTT 
172951 TCTTCTCCTT TAGAGTCAGG CTTAGTCATG TGACTTGCTA TGAGTGTTAT 
173001 CCTGTGAGTG TGCCATTTTT TTCTCTGCTA CCACAAGAAT AGCATAGCTC 

173 051 AGATAGAAGC TGTTGCTTAA GCCTGTCTAT CATTTATCCA GCCTATCATT 
173101 GACGGGCATT TGGGTTGGTT CCAAGTCTTT GCTATCATAA ATAGTGCTGC 
173151 AATAAACATA GGGAATATCA TACACTGGGG CCTTTTGGGG GGTGGAGGTC 
173201 AAGGAGAGGG AGAGCATTAG AACAAATACC TAATGCATGC AGGGCTTAAA 
173251 ACATAGATGA TGGGTTGGTA GGTGCAGCAA ACCACCATGG CACATGTATA 
173301 CCTCTGTAAC AAACCTGCAT GTTCTGCACA TGTATCCCAG AACTTAAAGT 
173351 AAAATAAAAA AGAAAAAAAA CTCATTGGCA GGACACACAT TAACATTATA 
173401 ATATTAAGAT ATTTGCATAG TCCATTAAGC TATACTTAGG ACTCATCTAC 
173451 ACAAGCAAGT CAAATAGAAA ATGGAAGCAA TTGCCTAACT AACTTCTGAT 
173501 ATCCCTTGCT TTAGTTATCT AGATATTTAA AGGGAAAAAA TTCTTAAATT 
173551 ATTCATTTTT ACTTTCTTGG ATTTCAAGCT TACTTAGGTA TGGACTTAAT 
173 601 TTCAAAGTTT CCTAATTTCC ACAAAAGACA ATTTGAAGTA AGATAACTCT 
173 651 TGGAGTGGAA CACCATATAA AAATGGCCTC TTTCTTTTCT CACCTGAGAA 
173701 CAGTATTCCA GTGAAATCCT GACTAACTAG AAATCCTTGC ACAGGGTTCA 
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173751 AAGCCACAAA TACATCATTT CATGAAATAG TCTTAAGTTA TCACACATCA 
173801 CTGCCAACTT CTGAAATAAT ACAGCAGGAC CATGTTGTAG TGTCTGACTT 
173851 GGAGATATCA TTTAATCGCT TTTATCTCCC ATATGAATTG TCTTGTTTGG 
173901 TGGAACTACC ATAACGACTC TTTTCTCTAT GAGTTAATGA GCCTGATGTA 
173951 CCAGGTTTTT TTGTTTGGTG GTAACTGTAA TTTGAGCATA CTCAAACTCA 
174001 ACTGCCCTTT ATTAAGGCAG AAGGTCTCTG CTCCACAGCT GAATCTGAAA 
174051 GCTGTTTGGG ATGATATTCT GTCCTTATAA TAACTTCTGG TATACAGAAA 
174101 TTGCCACTTG GCCCGTAAAA TGTTTATTGG AGGAAAAATG TAGACACACG 
174151 TATCAGTTTT TTTTCCCATC ATTCCTATTC GATTATTTTA ACTTAATTGT 
174201 CCCACCCCCT AAAAGGTAAT TTTCTTTCTG TTGGCTTCTT CACATTTTAA 
174251 AATAAAGTTG TAGCAAAATA TGTTCAAGAA TCCCTGGCTG TCCAGAGAAC 
174301 AAAGTTTTTA TGACTTATTT TCTGAATGAG AAAATAAGAG AAAGAAAATA 
174351 AAAGAAAGAA GGAAAGCATA TTGTGTCTCA AAAACAGAAC AAATTTTCCC 
174401 TAGTGTTTGA GTATAATTTT TTTTATTATA GTTGTGATTC AAATGGTGAT 
174451 GGTGAAAACA GGGTCACTGA GCTTCAAGAA AGCCAAGTTC TGCTTTGAGT 
174501 TTCTCAACTA GCTAAGGGCA TAACCTGGGG CAAATCTTTT CATCTTTTAA 
174551 TTTCTTTATG TACAGATTAA AACCTCTAGA CCAGTACTGG TTATCCCTGT 
174601 TTTTAGGTGC AGCATTAGTT ATCTCTGTTT TCTGTTTCAG GTCACCTAGG 
174651 TTTCAAAAGA AATCACTTCA GGAAGAGGTA TCATTCAGTG TAAACAAGGG 
174701 GCTTATCCAG TGTATGTACC AAGAGACAGA CTGTCTTCAT GAGTATCAGA 
174751 GGTGTTTCTG TCAAAACTGC CAAAAAATTT AGAATATAAA GAAAATCTAA 
174801 CTATTTTCTA AGTATAAGGC AGTGTTGAAT GTTCCAAATG CTGCTGGGAG 
174851 TACATTGTTA CTTTAATTTT TTTTTTTGAG AGCTCAGCAA CCCTTGTTGG • 
174901 AAACCTCTTA AATAACCTTT TCCTGGCTCC ATTTTAATTT TGTTTTTCTC 
174951 AGCGCTAGAA TTGTCAGCCA CCAAGCATGA AATGTAAGGG TTCTTTTTTT 
175001 GTAAAACATT TTAAAGGAAA TATACCCATT CTGAACATTT CTGGAGAAAA 
175051 TGACCTCTTA CCTACAGAAT TACCAAGGCA TTTGTGGTAA AGATGTATTC 
175101 TTTTCTCCTG GAGCCAATTT GGTATATTTT GATTTATGCC TTTGACATGT 
175151 TTTAAAGGTT AGGAAAATGT TCAATGTGTT TCTGCTTCTA AAGTTAAAGT 
175201 AGTCACTTCC TCAGATATGG CTGCTTAATG TTCAATCCTC TTAAAAAATA 
175251 ATTTGAAGGC TGGTGTTGGT GAAGGAACAA GTGCAGTTTT TGCATTAAGT 
175301 GTATTAAGCA GAGCTGCTGC TGACACAAAT AAAATATTGT TTGGGGAGGC 
175351 TGAGGCAGCT TTAGGGAAAG AAGTAAGGCG AGGGTGGGAA GTGCTTGAAA 
175401 AGTCAAGTCC TTTCTCTTTT TCCCTTCTCC CTGTTGGGCT GAGGATCTTG 
175451 TTTGACATTC AGAAGATGGG TAAAATAAGA TGCACTCTGT AAGCCACAGC 
175501 TTTATCAGGA ATTTTCATTA TTCACTTGTG CAAATCAGGA AGAACCAAGC 
175551 AGGAAGAACC AAGGAGCCTA AGTGAGAGCT GTGCTCTTGG ATTTTTAACT 
175601 TCCTTACATT ATGAAAATCA ATGATGAGTT TAATTGAACT AACATCCAAA 
175651 CTGTACCTTG AGTTTGACTG GCATTGAGTC CAGTTATGAA GATGTGGTGA 
175701 CACAGAAGAG AAGAGCACTA GACCGTGCTC TGTCCCTTTG GGCTAATCCA 
175751 CACTAGGCAA GCTAAGCCAC TTTAGTTTTT TCTCTCTCTC TGGCTCTCAT 
175801 TTTCCCACTC TGAAAATGAA GGTTTTGGAC TAGTTGTTTT TTAGGGTGCC 
175851 TTTTGTGCCA CTTCTCAGTT TGTTTTTGTT TCTTTACTCA GCATATTCTT 
175901 CTTGTTTTCT TGACCTCTAG AAATTTCCTG ACATCCTGCC CTCTCATGTT 
175951 GGAAGAACCA TCTAGTGAAT GGTTACTTAT CGCTACTTTC TTAAGGTACT 
17 6001 TATTTTTTTG TCATTGTTGT GTATTCTATT TTTAATTTCA CTTTTAAAAC 
176051 CATAAAATTT TGTAATTATC AATTTATGTC AACATTAAGC TAGATTTACC 
176101 TACATTTTTA TCATGTCTTT GCTCTTTATT TCTTCTTATA TTTCAAGCTG 
176151 TCCATCTGTA ATTATTTTTC TTCTGCCTGA CATCTATGCC TGTAAACAAT 
176201 TTTCTTTAGC TAGGCTCCGC TACAAGTTTT AGCTTGCTTG AATATGTTTC 
176251 TAGTTTGTCT CACTTTTCAA AATCATTTTT GGATATAATA ATCTTGATTA 
176301 GTAGTTTTCT TTTAGGACAC TTGAAGAGAT. TCCACTGTCC ACTGTCTTCC 
176351 ATTGTTGTTC TAAATGTGTC ATCTGTCACT TATAAATGCT GCTTCTTTGA 
176401 AGTAATACCA CTTCTCTCTA GCTGCTTTCT CTTTGTCTCT CTCCCTCCCC 
176451 TTCTCTCCTT C1TAATCTGC ATTTTCATGA CAATGTGTTC AGATGTGGGT 
176501 TTCTTTTTAT TTATTTATAC TTATCGGGTT TCACCAGATT TGTTACATTT 
176551 GTGGATAGCT ATTTTTCATC ACTCATAAAC TACCTGAGCT CTCAGCTCTT 
176601 CAATCATTTT TTCTGCTCTA GTTTCACCTC TTTACTTCTA TAAATATGAT 
176651 TATTTGTCTG ACAGCAACTT TTCATTCAAT TCTTCATTCT TCATGCCTCT 
176701 TAATCTCTTT TGTCGTATTT TCCATCTCAT TGTCTCTCTG TGCTCCATCC 
176751 TGGGTAATTA ATCCTGATCT GCATTTCAGT TGTTCAGTTA ACTTCATAAT 
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176801 TGGGTTAAAC TACCATGACT TAATTAATGT GAACTGTGCT ATAAATATTT 
176851 ATGCATTGGT TTATGGTTAC ATCTTTCAGA ATTTTATTTT TTATTTAGGT 
176901 CTGTTCAGTG CCGTCTCTCC TGAAGGGTAA TAAATGGTTG AGGAGAATTT 
176951 ACTTTTGCAT CAGACTTTTG TGGAACACTT TGGGATTCCA GCTTTATGGG 
177001 TGTTGAAATT GTTCTACTAG GCAAAATTGT TTTAGCAGAG CTAGGGCTTT 
177051 GTTCCTCATG ACAAAAACTA AGGTTCATAT TTACCATGTT GGGCAAAATC 
177101 CTGCAGGCCA AACCAGTTTT GTGTTTCCAT TTACCTTTCT TTGTTCTTAC 
177151 TTTTCTTAAA TTTTAGTCAT TTTTAGATAT TGTCTGGTCT CAACACCTTA 
177201 AAGAATGTGT GTGTGTCTGT GTGTGTGTGT GTCTGTGTGT ACATCTTTAT 
177251 AATACTATCT TATGGTTTTA TATTGTATCT TTCACTTCTA GTTGTTTTTG 
177301 GTAGAGGATG AAAAAAGATA TCTAGCCCAC CATATTTCTT TCCAAAATAA 
177351 TTTGTCTCTC ATTTGTTGTT TGTGACTGGT CTCTACTGAA AGTCCTCCTG 
177401 ACTCACTAGC CTGGCTTTCT TCATTTCCAC TTCAGGCTCT ACTTTCTATT 
177451 TCTGGCTACA AGGCTCAGTC GTCACCCTAC CTCTTCATCT GCCTCTGCTT 
177501 CCCAGGGAAT TGTATCTAAT CCCAAGGTCT TACATATCAT TCATATGCTA 
177551 ATAAATTCCA ATTTTATATT TACAGTCTAG ACTCTCGCTG ATTCTCAATT 
177601 ATATTTTTAA TCATTTATTA AACAGTTTCA TTTGGGCATT TATTTGTTTT 
177651 CTCAAATTCA GAATGTTTAA AATGAAACTC TGTTTTATTC ATTCCAATTT 
177701 TATTTTTTCA TCTTAATTCC CCATTTCTGA AAATGCTGCC ACCATTTTCT 
177751 AGACTTGATT AGCCCTAAGA CCTGTAAGTG AGCCTTGATT TTTCCTCTTA 
177801 TTGATAACAC GTCTAGATCC ATCTGGAAAT GCTCTTGTTC CACTTCTCAG 
177851 ATAGACAGGT GCTCCTCAAC ATACAAGGGA TTATATCCTG ATAAAACCAC 
177901 TGTAAGTTGA AAACATCAAT GTCAAACACA TATACTACCT CATAAAACCA 
177951 TATTATAGTT GAAAAATTAT AAGTCAAACC ATTGTAAATT CAGATGCTAA 
178001 ACTTACTATG AGGCTACTTC CTGATAAAAC CATTGTAAAA TCAAAAAATT 
178051 GTAAGTCAAA CCATCATGAG TCACGGATCC CCTGTATCTG TAATCTGATC 
178101 ATTTTTTACC ATACCCACTC CTAACATCTT CATCCAAATC ATTGTGATTT 
178151 CCAGTGTGAT CCAGTGTGAA TTCTGCAATA GTCTCCCACC TCTGTCTTTG 
178201 TTATCTCACT TCCTTACCCA CCAATATAAA TAAATAACAT AATCAGATAT 
178251 AATATAATAA TATATTATAA ATGATATAAA TCAGATTGTT TCTGTCCCGC 
178301 ATTTATAACC TCAGATGGCT TACCATATCA TTAATATATA TATTTATATG 
178351 TATATATATT TTTATATATA TAAATTTTAT ATATAAATAT ATATCTTATA 
178401 TATATTTATA' TATAATACAT ATATATTTTA TATATATAAA TATATATACA 
178451 TATTTAATAT ATAAATACAT ATGTATTANN NNWNNNNNAT ATATATAAAT 
178501 TATATATAAT TATAAATATA GA7^ATAAATA TAAATATAAA TATATAAATA 
178551 TATATAAATA TAAATATATA TAAATATAAA TATATAAATG GTATAAATAT 
178601 ATAAATATAA ATATATATAA ATATGTATAA ATATATAAAT ATATAAATAT 
178651 ATAAAAATAT ATATATAAAT ACATATATAA ATATATAAAT AAATATATAT 
178701 AAATACGTAT ATAAAAATAT ATATTTTTTT TTACCTGCAA CAATATGCCA 
178751 ATTCAACCAC TTAATAGCTT TCAAAACTCT GACCAATTAT GTAATTTCGA 
178801 TGCTCTTAGC TTTTTTTCTT TTTAAAATTG TAGTAATGAT AGCAACTATC 
178851 TGGGGTGATG TGGGGGGAGT TGCATGTTGA AATTAACTTA TACATTTGAA 
178901 ACAATTAGAA AAGTCACTGG AACGTGGTAA GTGCCCTATT AATATTTCAT 
178951 TGCTATTACC TGAAAATATT GAATTTGTTT ACTTATTTGT TTAATTTCTG 
179001 TATACTTCAC TAAAATATAG GTCAGTAGCT CAAATTATGA TATTCCTCAT 
179051 TGTCCTCTAT TGTCATATCT ACTGCATAGC AAATTGGAGG TGCTCATTTA 
179101 AGACTTAATG CCTGAATAAA TTTGATCTCC CACTCATTCT TTTATTAGAC 
179151 ACATGTTGAG TAAGTGTCCA CTATGGGCAA GACGTTGTGC TAGCTTTTGG 
179201 AACTACAAAG ATAAATAAGT CTCCGTCTTT GAACTAGTGG GGCTAATTCT 
179251 GTCTTGAAAA GACAACTAAG CAGAGATCTC ATTGAGGTGC AATGTGGTAA 
179301 GTGCTCTATT AGGAGAAATC TCTGGTAATT CCTGAAGTCT GGTGATATGG 
179351 AACACCATTT GGAGAAATAC TTGTGCTTTC TTTTAGATAA TGCTTTACAT 
179401 TATAAGATCG ACTCAACATT TTGGCTTCTG GAAGTTATCA TGGATAGGTT 
179451 AGAAAGAGAT GTAGAATCTT" GGTTATCTAG CTCTCCTGGA TCTTATCAAC 
179501 TTTCTGCAAT GAATGGAAGG TGAATTGGTG AATCCACAGA CATAAGACTA 
179551 TTCCACTGAA TTTAATGAAT TGGAAAAAAT AGCCTGAAAC AAAAGGAACT 
179601 GCTATTTTGA GAGTGGCTGC ATTGTCCTCC ACAGCATAAT GATCATGGAA 
179651 GGTCAATACT TCAGGTTATG ATGAAGGTAC TTTTAGTTAA GCAGAGACAG 
179701 AGCAATATCC TTGAGAGAGT TTTTCTTTAA AGATGAAACA CCTTACTGAA 
179751 CTTCCACTGG AGGGACTTTG AAACATATTT TAAGTAAGAG GAAGAGGACA 
179801 GCAATCATGG TGTGAAGAAA CAAGCCAGAA TTTTGAAAAA AAAAAAAAAA 
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179851 AAGGAAGGAT TGACTTCTAG CTGCTTGTTA TATCAGGATT CTCCAGAGAG 
179901 ACAGACCTGA TAAGATGATA GATAGATGAT AGATAGATAG ATAGATAGAT 
179951 AGATACATAC ATAAAGGGGT GCTATAACAG GAAAATTTCC TCATGTTATT 
180001 ATGGAGGCTG AGAAGTCTCA CCACAGGCCA TCTGCAAGCT GGAGACCCTA 
180051 GGATGCCAGT TGTGTGGCTC ATCCAAGTCT AGAAGCCTCA GAACCAGTCA 
180101 AGTTGATGGT GTAATTATTG GTTTGATACT GAAAGCCCAA GAACCTGGAG 
180151 AACCAAAGGG ACTTCTGGTG TTAAGTCTTG CAATCAAAAG GCCAGAAAAC 
180201 CTGAAGTTCT GATATCCAAG GACAGGAGAA TAAGAATCTC CCAATTTCAA 
180251 GATGGAGAGA GAGGAAAAAA AAATCACATT TTCCCTGCTT TCTTCTTCCA 
180301 TTTGGGCCCA CAGTGGATGG GATGGTGCCT GCCCACATTG AGGGCAGATC 
180351 TTCCCCACTC AGTCCACCGA CTAACACACC AATTTCCTCT AGACACACCT 
180401 TTACAGACAC ACCCAGAAAT ATTGTTTTAC CAAGTCTCTA AGTATTACTT 
180451 AATTCAGTCA AGTTAACACC TAAAGTTAAT TAGAACAATT CCAACCCCTA 
180501 TCAAATGGGT ACCCATATTC ATTCCTCAGA CCATGCTTAA TCTCCAAGTA 
180551 AAGACAATAG TAAGGTAAAT AGTTTCACCT AACATGATAG AACCACCATA 
180601 TGAGCAGCCC AAAATGCACT AATCCCTTCT CAAGAAGAGG AGGTAAAGAA 
180651 AGACCTTGGG TAATGTTTAT TCTTCTCCTG AGATCGTATA ACTTAAATAC 
180701 TATGATATAA GGTTAATAAT AATAGTATAT CTTATGTTAG ATATTGAAGG 
180751 AATAAGGGAG AAAGAAAAAC AAAGATACAT GCTACACACA TACACACACA 
180801 CCCACACACA CACTCAATGT ATTCTTAACA AAATAGGGAA GAAATGCTCA 
180851 TGACAATTGC AGTCCTTGTT TCCATAACTG GTTCACCTGG TTGTAATGTA 
180901 AGGTATTTAT AATTACCTTC TTCCACTACT TATTCTGTAT TCTCTGTGTC 
180951 TTCAGCAAGC ACCTTAATTA GTTGTGATTA TTTACCTAGT GGGGTGACCA 
181001 AAACCTTTGT TTCTGAAGGG TCTGGGTCAT TCGTAGTCCT TCCTGGATTG 
181051 GGTTGTTACA GTTTCCCATT GACCTTAATC ACAGAGTATG GTAAAACAAA 
181101 AATATGACCT AAGGAATCTC CTGTATTCCA GACACACTCT TCCTTACCTC 
181151 CATCGCGGAG TAGTAGTTCA ATTTCTTCTT GGTAGTCTGA AACAATCACC 
181201 CCAGCCAACA CTGTTACTTC TTTCTTAACT TTTTGACTCA AAGGCACGAG 
181251 GAACTCAAAG TGGCTGGGTG CCAGTCTTAA CTTCCAGTTC CATAAAATTA 
181301 TTGTTATGTC TTCTGGTGGC AGCATTTCTC TCTCTGGAAG TAAGACATCT 
181351 AATCTATAAA GTCGTGTAAA GTTGCAGGAA TAGGATACAA AAACTTTGCT 
181401 AGTGGGTCAC TAGGGATAAC GGTGAGTTCT GTCATGCCCA TGTCTCCCTT 
181451 TGATTCCTGG ACCATAAATC TTGACTATGC CATACATTTG ACACTGATTC 
181501 AGAGTATATG CAGCCTTTTG GAGAACCTTG CCCCAACCCT GCAAAGTATT 
181551 ATCATTTGGC TGGTGCTGTA ACTGCAACTT TAATAGGCCC TTCCACTGTT 
181601 CTATCAAGCC CACTGCTTCA GACTGATGAG GGACATGGTA AGACCAGTGG 
181651 ATCTCAGGAT AAGCATGGGT CCGTTGCCAC AAGTGTTTGG CTGTGAAGAG 
181701 ATTTCTTTGG TCATGGCAAC ACTGTGTGGA ATACCATAAT GGTAGATAAG 
181751 GCATGCTGTA AGTTGATGAA TGGTAGTTTT GGCAGAAGCA TTTTGTGCAG 
181801 GGAAGGCAAA TCCATATCCA GAGTGTCTTT TTTCCAGTAA GGACAAAACA 
181851 CTACCCCATC CATGATGGGA AGTGGTCCAG TGTAATTAAT CTGCTACCAG 
181901 GTAGCTGCTA ATCACCCCAG GGAATAATAC CTCATGAGGG GCTCAGTGTT 
181951 GGTCTCGGCT GCTGGCAGAT TGGACATTCA GCAGTGCCTA TAGCCAGGAT 
182001 GGCTTGGTGA GTGAAAGTCC ATGTTGCTGA GCCCAAGCAT AACCTCCATT 
182051 CCTACCACTA TGGTTACTTT GTTCATGAGC CCACTGGGCA ATAACAGAGG 
182101 TGGTTGGGGA AAGAGGCTGA CTGGTATTCA CAAAACCACT TCCATTTGAT 
182151 GATGGAATGC TGCTGTGCGT TCCCAACTTT ATAGCTTGTT GGGTAAGATG 
182201 ACACTAAGTT TATGATAGGC AGCTCAGGTT ACATGATGAG TTGGTGGCCC 
182251 ATGATCAAGT GTTCAGTTTT TACTGAGGCC GAGGAGCAGG CCATGAGCTG 
182301 TCCCTCAAAG • GGAGAGTAAT. TATCTGTGGA TAATGGCAGG GCCTTGCTCC 
182351 AAAGTTCTAA GGGCGTCCCT TGAGATCACC TATGGGGGCC TGCCAAAGCC 
182401 TCCAAACAGC ATCCCTATCT GCTACTGACA CCTCAGCTAC TATTGTATCT 
182451 GCTCGATCAT ATAACCCAAG TGACAGAGCA GCCTGGGCCT GTTGTACATC 
182501 CTTCTCTTGT TCTGGGCTCA CTCAAAACCA GCAACT^TCT AAGTCACTTG 
182551 GTATGAGCCA GAGTAACACA GCCACATAAG TAATGTATTG CCTCCAAAAT 
182601 CCAAGTAGGC TACCAGGCAT TATGCCTCCT TCTTGGCTCT AGGTGTGGCC 
182651 AGATACAGCA^ATTTATCCTT CACCTTAGAA GGACATGTCC CATACTACTG 
182701 GATCCCTAGA AACATCACTG AAGGAGAAGG CCCTTGAATT TTTGATTTAT 
182751 TTCTCACCCT CTGCCATGCA AATGTCTTAT CAATAAGCCC AAAGTAATTA 
182801 CTACTTCATG CTCACTAGGT CCATTCAGCC TAATATCATC AGTGTAATGG 
182851 ACCAGTGTGA TATTTCATGA AAGGAAAAGA TGATCAAGAT CCCTGTGAAT 
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182901 TAAATTATGG CATAGGGCTT GAGAGTTGAC ATACCCCTTA AGAAGGACAC 
182951 TGAGGGTGTA TAGCTGCCCT TGACAGCTGA AAGCAAACTG CTTCTGGTGG 
183001 GCCTTAGAGA CAGAGGTAGA GAAAAAGCCA TTGGCCAGAT CAACAGCTGC 
183051 ATACCAGGTA CCAGAAAATG TGTTAATTTG CTCAAGCAAC AGAACCGCAT 
183101 CTGGTAAAAC AGGTACAATT AGAGTCAGCA CTTGGTTAAG CTTGTGATAA 
183151 GCCATTGTCG TTCTCTAAGA CCTATCTGTC TTTTGCACAG ACCAAATAAG 
183201 AGAGGTTACT GGGAATGTAG TGTGAATTAC CACCCCTGAA TCTTTCAGTT 
183251 TCCAACGGTG GCACTAAATC TATGTAATCC CTCCAGAGAT GCGGTATTGT 
1833 01 TTTTGGTTTA CTGTTTTTCT AGGTAGAGAA AGCTGTAATG GCTTCTGTTT 
183351 GACTTTTCCC ACCACTCCAT AGGTTAGGGA ACTAATGTGA GGATTCCACT 
183401 AACTGCTTGG CGTGTCCATT CCAATTACGC ATTTTAGAAT TGGGGAAATG 
183451 ACCACAGGAT GGTTTCAGGG AACCACTGGA CCCACTTGGA GTTGGGCCCA 
183501 AGCTGAAACT CCATTGATCA CCTGATCTCC ATAAGCCCCT AATTTACCTG 
183551 GAGGACCACA GTGATGTTTC TGATATCTTA AAATCAATGT CAGTTCAGAA 
183601 CCAGTGTCCA GTAGTTTCCT AAAGTTCTGA TTATTTCCTT TGTCTCAATG 
183651 CACAATTATT CTAGTAAAAG GCCATAGGTC CCTTCGGAGA AGGATGGGAG 
1837 01 AATCATTAAC CATATAAATG TTCAGTAGTG TACAGGGGTC TTTGCTAGAG 
183751 GGGACCCAGC CTCCTCTTCA TTTAAAGGGT TTTGGGTCTG TAAACTGCCC 
183801 CAAGACTGGA AATTTAGTGA AGGGCCATGA TTCTCTGTTT TAGTGATTTG 
183851 AGTTACACTT TTGTTCATTT GACCTGAAAG TTTTCTGCTT ATACAGATTA 
183901 AGTAAAAATT TAGTAGGTTT ACTATCTACT TTACTTCTAG AAACACTGTG 
183951 AATAACTAGC TAACACCATA GGTTTACACA AGTCAGACTA TTCTGATTGT 
184001 TGCTTTGCCT CTGTTACCCA TAAGAGTAAC TTCACCCATC TTGCCTTTGA 
184051 TGATTGAGGG CCACCACTTA TTCCTGCAGT CCTGAGATCT AATTATTTCC 
184101 ATAGCATCTA TGTTTTTCAG TCATGTGACT GTGGCTTCCA CTGTGAGGTC 
184151 CAACCTCCAG AGAAGGACCA TCATGGAGCC CTTCAGGAAT GCTGGGGCTC 
184201 CCCTCCCAAA TTTATTCTCA AAGTATTGAT GAAAAGTATG TCTTCTGGAC 
184251 CCTCCCAGTG TGGGTGAGTA GGTTTTAAGT GACAGATCCA CTATAGCATT 
184301 CCAATCTTCC TAAGCCTTTT AATGTTTTTG AATTCCTTCC TCTACATTAA 
184351 ACCAAGGGAG ACCAGGCATC TCCAACTTGC TCATGATGGG ACACCTTTTG 
184401 ATTTATGTTT TAGCCAAACA ACCAAACTAT TAGCACTTTC CCTAACTCCC 
184451 CAAGCTACCA CATGAAATGC AGAATCTCTT CTTAGTGAGC CTATATTAAT 
184501 AAATTGGACC TAATCCAATT TTGTGTTCTT TTCACTATGA CCTCACATCC 
184551 TTAATACCTA TTCCCACACA TGTTCTTTGG ATTTCTGATT GTATAAATTA 
184601 GAAAACTCAA GTAGATCTTT TTTTAAAAAT TTTATTATTA TTATACTTTA 
184651 AGTTTTAGGG TACATGTGCA CAATGTGCAA AGTAGTTCTT TTGGAGTGTA 
184701 GTGCACCTCC TCATGGGCCA CCCTTTGTGC CTCACCTTTA GGAGCCTGCT 
184751 GGGACTTGAG TCTAGTTATA TGTCTAGAAG CAAAGAGGAA TATTGTTGGT 
184801 GGGTTCTGAG AAGAATCAGA ATTGTCTTTG TAGGCAACTG TCTCACAGGA 
184851 GCTCATTACC ATTTCTTTAG GCAATGCAGG GTTTATCAGG CAGATGTATT 
184901 AATATAAAGT ATGGTACCGT TGTGGGAAGA GATACAGCTG CAGCCACTGG 
184951 GGGTGAGGGT GGGAGGTCAC TTTTTCTGGC AAACAAGACT CATAAGAATT 
185001 TAGGGGCTTA ATGTCACCAG CCTCATCAGG GCCTTTCTAC ATGTGTTCCC 
185051 ATCCAACTTA CAGAATCCCA TTTTTTCCCC CAATCAATAC TGCCACTTTA 
185101 ACAGTAGACA CCCTGCAAGG 1 CTGACTTTAG CTTTCATTGT AATTCAGACA 
185151 ATCATATGAT CAGTGCTTGT GTTTGATTTT CAGCAATTTC AGCCCTGTGG 
185201 ATGCAGGAAG AAAGATTCTG ACCAGGCACA CTTAAAGAAG CCCTTAGGCT 
185251 ATTTATGCTT ATTTGAATCC CTGAGCTAAT TCTTTTTTTT CATAATTTTG 
185301 TCCAGTGACA TTAGAAGAAA CCAACTGAAG TCATTATGTT TCTTTGTTTT 
185351 CCACAAATGT TCGAAAGTGT CATATACAAA GTCACCAGGT TCCTTTCTTC 
185401 TTGTATGTGT TTAATTCAAA GTATCAAATA TAAATATTCT GGGTATCTAT 
185451 AAAAATCCTA TGCCATGGAT TCTCAGTGCT CTCTGTACTA TTAGAAGTAG 
185501 AGCCCTGAGC ATTTTTAGAT CTAATCAGAT TACAGAACGA GTTCCAGAAG 
185551 CTCCAAAATC AGTTAATAAA ACTCATCCTT AAAGTCTTCT TTCTCTAGAA 
185601 CCACTCTGGT AGCAAGATCT GTATCAGTCA GGTTTCTCCA GAGACAGAAA 
185651 CAATAGGATG GATGGAAGGA TAGATGATAG ATAGATAGAT GGATTATTTA 
185701 TAGATAGATA GATAGATAGA TAGATAGATA GATAGATAAT GTGTGAAAAG 
185751 AGATTTATAA GGGAAATTTG CTCACACGAT TATGGAGAGT GGGAAGGCCA 
185801 TGACAAGCTG TCTGTGAGCT GGAGACCCTG GGATACTGGT AGCATGGCTC 
185851 AGTCCAAGTT CAAAAGCCCC AGAATCAGAG AAGCTGATGG TGTAATTCTC 
185901 AGTCCAAGGC CAAAGGACTG ACAACTGGTG TGCTGCTGTT GCAATTCTTG 
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185951 GAGAGCCTGA AGTTCTGATA TTTAAGAGTA GGATAATAAG ACTGTCCCCA 
186001 TTTGAGGAGA GAGAGAAGAG AGAGAAAATT GCCTTTTCTC TGCCTTGTTG 
186051 TTCTACCTGT GTCCTCGGCT GATTGGATGG TGCCTGTCCA CGTTAATGTT 
186101 GGATCTTCCC TACTCAGTCA AACTCATATG CCATTCTCCT CCGGAAACAT 
186151 CCTTACAGAC ACACCTAGAA ATGATGCTTT ACCATTTCTC TAGATATTTC 
186201 TTAATTCAGT TAAGTTGACA CCTAAAATTT ACCATTACAC TTCTCTACTT 
186251 CATAGCTCTG TGATCTTGGA TAAGAGACTC AACCTGTCTG AGTATCTGTT 
186301 TCCTTGTCTT TAAAATGGGG CTAATAATAT CTACAAATTT GTCATAAGCA 
186351 TTGGAAACAA TGTACATAAA ATACTTGGTA CATAGTAGAC CTTCACAGTA 
186401 ATTATTATTC ATTTTTATTA TGAAATAAAT AGGCATAAAA AGAATGTAAA 
186451 TTAATCTGTC TATTAATATG CAAATTTGCT TTTATTCATG TGCGACTCTT 
186501 TCTTTGCTGC TATTTTTTAT TATTGCTGGT AATTAATGCC TCAAATGCTA 
186551 GCAGCTTTTA TGCCTTTAAC TATCAGAAGC TTAGCAGCTC TACCTTGTTT 
186601 ATATTCAGGT TAACTTATTG CTGCCTATGT GAAATTGTGT GAAGGGATTT 
186651 CAGAAGTCAT GGATAATTTT CATCAATTGC TGAATTCTAT CCTGGAGGTT 
186701 GAAATATTTT CACAAAGGCA ATGACCTCCT CTTCCTACCC TGCCCCATTA 
186751 AAGATAAGGC TTAGATACTA CATCTCTCTT GAAAATCTTT CTGCTCTTTC 
186801 CTTCTCCACA TTTGTCACTT TAACATCTCC CATGCATGCC CTCTAGACAA 
186851 AATTCATCTT TCCTCTCTAT GTAGTTTGAA CACATTGCTT ACATTTACCA 
186901 TAAGTTGTGA AAACTAAAAT ATTGTTTTTT GTGATACATG GCACCTAGGA 
186951 GTTTGAGGCT GCAGTGAGCT ATGATCATGG CATTGAACTC CAGCCTGGGT 
187001 GACAGAGTGA CACCTAGTCT CAAAAAAAAT TACAATGTGA AAAAGTATGA 
187051 AGTTGGTGAA GATGATGAAT TTCAAATTAG GCTGCAAGGT CCAACTTGAC 
187101 AAAACTCTGG GTTCATTGGG CAAAATCTAT TTCAGAATGT TTTTCTATGG 
187151 CTGATTTTGA TATCTAGGGT TGGAAGAGAA TGGAGAAATC ATGAGGACAT 
187201 TT AGAATTAT GAGGTTAGTG CCTGCTAAGA ATGAAAGGGG GAAATTTATG 
187251 CAGAGAAGGG AGATGGTCAT AAGACAATAC AAACACCACC AGTGTCCAAC 
187301 TTTGATGAGT ACTTTTCTTA TTAAGCACAT ATTTACCGAC TTTATTTTAT 
187351 AACATCATCA AATAATATGA TTTTCTATAC TCTCACAAAT CACTTATTGG 
187401 GGAACATTTT ATGCACTTGA AAATGTGATT TTAGATAGAA GAATCCCAAT 
187451 GCGCTTAAGA CTCAAATGCA AGACAGCATG TAATCATGCC AATGAATTCA 
187501 TTTTGTAAAT AGATGCTTAT AGCAAATTAT AAGTAATTAT AAAGTAATTA 
187551 TGTAAATTAT TAAGTGACAA TGAATTAGTA TGGCACATAT TTATAAGGAG 
187601 GGAGTTATTG TTGAGCATTT TGCAATGCCA ATGTTAATTA ACAAAGGCTG 
187651 TATTAGAGTT TGGGGGGAAA GCTTTACTAA CCCAAGCCTG A.GGATATCTG 
187701 AAGTGAATGA GAAATGTTAA TCTGCAAGCC ATTTGTGTTA GATATATACT 
187751 ATTTTCCTGA TGTATTAACA TTTTCTAAAA GATAAATTAG ATTAGTTCAG 
187801 CATAGCTTGT TCTTTGTCAG CO?CACGCTGG TGCCTTATGA TAATTGTTTT 
187851 TCTTGGCAGA ATCTGGCAGG GATTATCAGC TATCATGTTA AACCATCCCC 
187901 CAACCCCTCA AGTGACCTTT TCCCAACTAT GAAAACCAGA ACAACAATTA 
187951 CTCATTTTTG GGCCATTGAC AGTTCTCATT ATTTCATCAT TTCTCAAGGT 
188001 TCATTAATAT TTGTGTCATG ATCAGTTTTG CATGTTATTT TAATTTCCTG 
188051 AGATAAAATT ATTGTTGGCT TCCACACATA TTCTTTTAAA GCAACAAGCT 
188101 GCTTTCATCC CAGTTAAACT TGCCAAAACT TTCCCTACTA ACCCAAAGTC 
188151 CTTAATCTGA CACCAAACTC AGTGCACCAA GTTTGGTATC AGATTAAGAT 
188201 AATTAAATCA TTATCTTTAA GAGTGATTCC TAAATCTCAA GGGGTTCCTT 
188251 CCAAAGGAGA TGCATCAGAA TCTACTGCAC CATTGAGCCT TGTGTGATAA 
188301 AGCCCTCCAC AAGATTTAAG GAATCAGAAG TAAAGGCTTG GGAGGATGGG 
188351 CATACAATTA GGCCCTGCAT AAGGAAAATG CATATAACCA GGAAACCTGT 
188401 TTTAATTAAG CAGAAGTCTA CATGCTAACA AAGAGAAGAA ATTAAGGAAC 
188451 TCTAGCCAAA ACCAGATTTA TCATAATTTT CATACATCCA TTTTGTTTTC 
188501 TCCCTTCTGT ATCTGTGATT ATGCAAAGCT TGCCTGATAA ATGATAATTG 
188551 GGATATAATC AAGAGATAAA TAGGGGTTAG GAGTTAACCA TGTAAGAAGT 
188601 TAGAGAAGGC TGTTCAGGCA GGTAAAATTG CATATATGAA TAATGTGAAA 
188651 CCTAGTGTGG GGATGTGATT TTGCCTTGGA GACATTCAAA GGTTGTATTA 
188701 GTCTGCTTAG GCTGCTATAA CAAACTGTAA AAGACTTGAT GGCTTAAACA 
188751 ACAGGAATTA ATTTTTTTCA CAGTTCTGTG GGCTGGAAGG CTGAGATCAG 
188801 TAAGTCAGCG TGATCAGGTC CTGGTAAGGA CACTATTCCT GGCTTGCAGA 
188851 TGGCTGTCTT CTTATAATGT CTCTTCACAT AAGGGCACTA ACCCCATCAT 
188901 GAGGACCCCA CCCTCCCGAC TTTAACTAAT CCTAATTACC TCCCAATGGC 
188951 CCCATCTCCA AATATCATCC CATTAAGGGT TAAGGCTTCA ATGTATGAAT 
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189001 TGTGGGAGGG GACAAACATT CACTCCATAA CAAAGGTCAA GGAACCTGGA 
189051 GAGCAAGAGA AGATGAATAT GAGATGAAGA TGAAGAGCTG ACTAAGGAGA 
189101 TATTTTTAGA AGCATGATAA TGGCAGGTCA TGGGGCTGGA AAGCGAGGAT 
189151 ACTTCCAGTA AGTCAGGAGA GTAGTCAAGG AAGTGACGAT GTCTTTGGGA 
189201 CACCCCAACT GTGGTGATCT TACAATCAAC AAAATAAGCC TCAGCATTCA 
189251 CACTGTAGTT AAGCTCATTC AAGCAAAGCT ATCTCCAGTA GGGAATTTCC 
189301 CCCATGAAGA GCATGAACAT TTTGATTTTA CCTGTTGTCA AGCTGATCCT 
189351 CATTATTTGC TCATCTTAAT GGTAAAAAAC ACAGCCTGGG TGGAGATTTA 
189401 AGCTAATGAG ACATGTGATA TATGAACAAG CATGCACAAC TACTGTACAT 
189451 GTCCACCCAG AGGACCACCC AGAACACGCA TACTAGCAAC ACCTCTTACC 
189501 ATGCCCTTAT GAATAACCAT GTAAGACTCC CATAAAGGAA GTCTCCCTAG 
189551 TCTCATTCTT TGCCGTCTCA ACCCTACTAG CACCCCGCCC TGAATCCTCT 
189601 CACTCTCGGG GGTACTGTCT ATTCTGCACC TAACTTTCAA AATATTGTTT 
189651 CTCCTTTGCA ATATGTAGTG ACATATTTTT ATTTAGCAGT TTTGAAACAC 
189701 TCTTTTTGTA GAATCCGCAT GTGGATATTT GGACCGCTTT GAGGCCTTCG 
189751 TTGGAAACGG GAATATCTTC ACATAAACAG TAGACAGAAG CTTTCTCAGA 
189801 AACTTCTTCA CTCTGTGATG CATCTCCTTT GCTCTATGCG TCTCGTTTAA 
189851 ATTATTTTAA ACTGTGAAGA CAAGAACCGA GGTCTCACAA CAGCTGTCAA 
189901 CACTCAGTGG TGAAGAGCTG TGTGTTAAAT GCCACAGGAC TTAGATCATG 
189951 TGCAAAAGAT TTTATAAGCA TTAAACTCCC AAGCAGAGTA CTCTTATTTC 
190001 AGAAAAATCA CCCTGGGGGT AGCATAAAGA CTGGGTAAGA AAGAAAGAAT 
190051 CTGTAGGTGG GAGGACACCT AGGGCGGGGA TGTTCATTAT AGGACACTTG 
190101 TAGTAGTCCA GAAGAAAGAT GGCAGGAGCC TGAACCATGA CAGTGAGTGA 
190151 GATGAGGGAG CTAAGGGAAG TTCTGGGTAT TCAAAGGAAA TAGTTGCAGT 
190201 TCTAAGCCTA AAAAACAGAG GATAGGTCAC AGAAATAAAT AATAGGGGCA 
190251 TATTGTTTCC CTAAATCTCT TCTGTGAGTT GTGCAATATT TTAATTTTAC 
190301 CACCCAAATT TCTATGTCCA TGCATAATGG AATTCACTGA GGCAGTAGTG 
190351 GCCGTCAATC ATTGCTCCAG TGCAGAAGTT CAGCTCGTGC TGGGGTTATT 
190401 TTCATGACGT TTTTGTTCAT TTTCAGTTGC TATAACAGAA TACCAGAGAC 
190451 TGGGTAATTT AGAAAGAAAA GAGGCTTATT TGGCTCACAT TTCAGCAGGC 
190501 TGGGAAGTCC AAGACTGAGG GGCTGCACCT GGTGAGGGCC TTCTTGCTGT 
190551 GTCATGACAT AGTGGGAGGA CATCACATGG CAAGAGGCAA GGAGTGTATC 
190601 AGCTGAAGTC TCTCTTTCAC TTCTTATAAA GCCACCAGTT CCACTGTGGG 
190651 GGCCCAACCC TGATGACCTT ATCTAATTCT AATTACCTCC GAAAGGCCCT 
190701 ACCTCCAAAT GCCATCAACA TATGCATTTG GAAATTAAGT TTTCACCAAG 
190751 TAAAATTTGG GGGATACATT CAAACCATAG CTCATGGTCA TGACAATTTC 
190801 GCGGATAGTT GTGCCATGTG TCTGGACTGG GACCATTGAA TGGTGGTTTT 
190851 CCATCACTGA AGAAATAATT AAAGTGACCC AAACAAAGTG AGTGAGTGAG 
190901 CGTAAAAGTG TCCAGGGCTA GGAACTATAA ACGGTAGGTC ACCTTATTCT 
190951 CAGTTGGGCT GAATTCTCAT GTTGGGCTCT ACTTTAGTAG AAGGCAGTTG 
191001 GGAAGTGCCT TCTTACCAAC ATCAGAAAGA AGTCAGATAA AATCCTGACT 
191051 TCTTTCTGAC TAATCTGGGA CTCCTATGAG AAAAAGAATT GGTACCTTTT 
191101 AGTTAAAAGA AAGAGATCTC TTCCTGAAGA GACCAGCAAT AGGAATATTA 
191151 CACTGCTTCA TCTCATATCT AACCCCAAAA GAGCCAAGAA AATGCACACA 
191201 ATATGGAGAG AAGCAAATGG ATCAAAACTG AGATTAGAAC ATAATCCCTC 
191251 AGCAGCCAGA CAGTCCTTAG GAAAGAGAGG CAGTGCTGTG GGATAGGTCC 
191301 AAAACAGGAA GTCCAAAAAC AGAAAAAAAC TACTTTTAAA ATTTAGATTT 
191351 ATTCGAGAAA GAATTTAAAA TATGTCAAGC ACATACACTA ATTTAGTATG 
191401 TTTATGATTA TCCTTATAGT TTATAATACT ATTTTTATAA TTTACTATAT 
191451 TTCCCAATGC TGATATTCTG .CTTTCTAGCA TATGCTATCA TTTTACATAT 
191501 ACATATGCAT ACACACGCAC ACACACACAA ACACACACAC CCCAGGATGT 
191551 GTGTGCTAGG ATAAAAGGCA GCAAAACCAA AAGATGCACT TGCTTACAGG 
191601 CACTGCAAGC AGCATAATCA AGGATGTCCA CGAAGGAGAG AAGCCTAGAG 
191651 CTCAGACCTG GAGAGCTGTT TCAGTGCTAA GCACGAGGAG TTGGTAAATA 
191701 AGGAGCATGC CTGGAAATAG AGTGCAAGAT GAGCCAAAGG GAATGATTTA 
191751 AGATGGAGTC TATGAATGAA GTGAAAGTGT CAGGAGCCCT TTCTTACTGA 
191801 TAATTGCTTT GTGGAGATAG GTTCACCTGC AGCCTAAAGT CACGGAGCAC 
191851 GTTCAGAAAA TAATAGCAAT TTGCCATGTT TGTTGGGAAT AAATAGAAAT 
191901 CTAGGGAAGT GATTCTCAAA GTTTGTTCTC TGAACCATCA GCATCAGCAT 
191951 TATCTGGGAA CCTGTTAGAA ATGCAAATTA TCGGACCTCT CTCCATATCT 
192001 GCTGAATCAG' ACACTCTGGG CTGGGATCTA GAAATCTGTT TTAATAATGT 
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192051 TTCCTGATCT AAGGCAGTGT GTGTACATGT TCCTGTGTGT GCACAAGTGC 
192101 ATGAGGGAGG TATTGAAGAG GACAAAGGCA GGGACAGGTT GGGAAGAGCT 
192151 AAGTATTGTT TTAAGCAAAG TCTTTCTCCT TTTGGTATAG GTGACCTCTT 
192201 CTTCTCCTGG TCTGACTCTG TAAGGGGTGA AAGAGGATAT AGTAGAATCC 
192251 CAAGCAAAAA AACCTGAAGG TAATTAAGGG GGAAGAAGGA AAGCCCTTCC 
192301 TAAAAAAATA ACAAAAAGGG ACCCTACTTT GTTATTTTGC CCCCACATTC 
192351 TTTATGCAAA CCTTGATTAA TTAAATTTTT AAAACTTGCT AGGTTGTTAT 
192401 TGATGGAACT CTGCTCAAAA ACGCCGAATA AGAAAAAAAA CCTGGAAAAA 
192451 AAGGCCACAG AATTCTAAGG GGAAGAACAC CTTTCATGTC TTCTCAGCCA 
192501 AGGAAACCTA ATACCTCTGA ACTCTAGCCC CTTAGCCCTT CTGAAGCTCT 
192551 TTAAATAGCT CATGGGGGTT TAGGAAGATT TTTCAAAGTA CTATCTGAAT 
192601 GATATTAACT TGGAATATTT TCTTTTAGCC CCCAAAGGAT TCATTTTTAA 
192651 AAATGCTGCT TGATCATTAA CACAACCAGT GTCCTCATTA ATAATCTGAG 
192701 CTGCCTACAA CCATGTCCTG GTTACCTCTG TGCTCTGCTA TTTACAAGCT 
192751 GGTCACAAGG AGCTCTTGCA AGTCATTGAA TAATAATCAA TTTGCTTTGT 
192801 TTTCCTTCTA GATTCATTAA ATTGTCTTCC AGGATGCCCA TTAAACATGC 
192851 ATTTCTCATC AACTAAGTGG TTCAAAATGA GAGCCTTTGA ACACTTCTAA 
192901 TTAGCTCTGT AAAGTTTAGA ACAAAGTTAA CAGAGACCCC AAAAGACTAT 
192951 TGTTTGGAGT TTCAGCCCAC GGTTTGCTAG TTGCCAAGTA ATGCAAAATC 
193001 AAGAGTGCAG TTTTTACCCC CTGAAGATCT CTTTGAAAAA CCAAGATAAA 
193051 AGGAAGTACA TTATTTACAG TAGTTGTCAC ATTTGTGGCT AATAATTGTT 
193101 ATTTAGGTTA ACTCACATGC ATACGAAACG TGTTGCCTCA CTTGGAGAGG 
193151 AATGGCTCGG TTTTCCAAGC CCTGTAGGCC TTTTCAGAGA CCTGAGCCAA 
193201 GGGAGGTCTA ACATCTAAGA AAGATTGTTA TAGTACTCCA GAGTTTAAAA 
193251 CTCTTTTTTC TTGGATGAAA GCACTGTAAA ATTAACTCTT ACCTAGCTAC 
193301 AAAGATTGGA GGGCATTAGC AAGACTACCT GCGAGATATT TGAGAGGCAT 
193351 ATGGAGAAAA TGGATGATCT CTCAGAAATA TCCTCTTAGA ATAGCCGTTA 
193401 TCAACCTTGC CTATGTATCT GAAACCCATG GAAAGCTTTG AACCAATGTT 
193451 GATATCTGGG ACCCATCTCA GACCAATTAC TTAAAAATCT CTGCTGGTGG 
193501 TATAGCCAAG ACAGAGTCAT ATTTCAAAAG CTGCTCAGAT GATTTTAATG 
193551 AGAAGCCGAA GTTGAGAGCT ATTGTCTTAG AATCCATCAG TTATCACTTT 
193601 CTGGTCCCTG AAGAAGAACT ATGGGACCTA TATGGGTTAA AGGGATGCTC 
193651 CTCCCCACTT CCTACTTATA AAGGAATAGC TGCTCTTCTA TTTTTGTCAG 
193701 AGCAGTAGCA TAGATTATCC AAAAGCCACT TGTGTTTTTT TTTATTGTTT 
193751 TAAAAAAATC TTAAAAGAAT CATCTAGTTT AACTTTGTAT TAATAAATAT 
193801 TAAAATTTGT TTGCACTCT^A ACTAATCCAA TAAGCATACA TTGACTAAGC 
193851 ATAGAGATGG ATTTTGGGGG ATATGAAGGC TACTGACATT GAAATAGTCC 
193901 CACAAAACAC AGAGCAGATA CACAGTTAAG AGTGAACTTT AATTAAGCTG 
193951 TAAATATTTT GAGCTCTATA GATTTTTCTT AGTATCTTCA GGTTCTAGAG 
194001 AGTTAATGTG GAAGAAGCAC ACCCATACAG GTAAATTAGA GAGAGACCAC 
194051 AGCATGGCAC ATGAAGCTTC CAAGTGACTT TCAGTTGTGG CCTCAGATAA 
194101 TATTTATTGT GAAAAACCCA TTGAAATCCT TAAACCTTGG TAAACTTAGA 
194151 GTTTGGTAAA TTGTCTTAGA AAGTCAAATC ATCCATATTG GTTGGTTCTT 
194201 TGTATAACAT AACTGTAAAA AATTATTTAG TGGCTGGGTG CAGTGGCTCA 
194251 TGCCTGTAAT CCCAGCACTT TGGGAGGCCG AGATGGGTGA ATCATGAGGT 
194301 CAGGAGTTCA AAACTAGCCT GGCCAACATG GTGAAACTCC GTCTCTACTA 
194351 AATATACAAA AAGTAGCCAG GCATGGTGGT GGGTGCCTGT AATCCCAGCT 
194401 ACTCGGGAGG CTGAGGCAGG AGAATCACTG GAACCTGGGA GGCGGAGGTT 
194451 GCAGTGAGCC GAGATTGCAC CACTGTACTC CAGCCCGGGT GACAGAATGA 
194501 GACTCCTTCT CAAAAACAAA CAGAAAAATT ATTTAGTTTA GGAGTTGCAA 
194551 ACTGTCAGCT ATATTTTGTT TGGCCAACTT TGTGTTTTAA AAGCATAGGT 
194601 CAGTGGAATT CTTTGTGATT CCACTGTGAT ATAAACAGAA GGCAAGGAAA 
194651 TACTGGGAAG AAGAGGGTAG GGTCCCTGGT GAGGGCTCCA CCCTCAAGCC 
194701 TGGGCCCATC ACCCTAAATG AGAACTTCAC ATTCCTGTTT TCCGGCCTGA 
194751 ATGTTGCTTT TTGGCTCACA CTGCCCCCCT ATCCTGTACC TATAAAACCC 
194801 CTAAACTCCA CTGGCAGAGT AGCAGAGCGG CATGGCAGAG AAGGAGAGAA 
194851 GAGAAGAAGC ATCTGAATGC CAAAGAGGCA GCTGGATGGT TGGAAAGGAG 
194901 TTTGGCTGGT GATGGCCAAA GTCCAGGGGG AAGATTATCT TTCCACTCCA 
194951 TCCCATTTCC AGCTCCCCAT CCCACTGAAA GCCACCTCCA TCACTCAATA 
195001 AAACCTCTGC ATTCACCAAC CTTCAAGTCC ATGCGACGTG ATTCTTCCTT 
195051 GATGCTAGAT AAGGACCCGG GTACCAAGAG GGCAGGATGT GGTGTAAAAG 
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195101 GCTGTTATCC TGACTCTTCA CTGAGTTGGT TAACACTTAG CCATCTGTGG 
195151 ATGGCAGTTG CTAAAAGAGC ATTAATTGTT ACACACCCCT AGATGCTACC 
195201 ATGGGGCTGG AGCCCAAAAG CACTTCCCCA GCCCCAGCAC CCACTCACCT 
195251 GCCAAGTTCC CCCTCCCACA AGGGGTTTGA GTGTGGTGGC CGAGTAAGCA 
195301 AGCCATACCC CCTCTCACAG GTTCCATGAA AGGGTCCAAA GAACTCTCCC 
195351 ATCTCAAQGG TACTGTGAAA CAAAAGCCAC AGGAGTTGAT CTGGTATTAT 
195401 TCTGACTTTT GTTTAAAGTG GTCTTATGGG AACTAGGTCA TCCTAACATA 
195451 TTTTTCTCTT AGCAGGGCAA ACCAAGCAAT ATCTTGTACT ATGTTTCTCC 
195501 TCCAGATCTA GCAGATGACT CTCTTAAGAT CAGGACAAAT GTATCACTAA 
195551 TAAATATTTC AGGGCATGTT TTATTGTCTT ATACATTTAA ACCTTACTTG 
195601 TTCCCTCGTG TAGGTTCTAA GTCCTATATA GAGAAGGACC ACCTATACTT 
195651 ATTTTGTACT TCCCTCATTA CATGAAGTGT AATATTTAGA CACATCATGA 
195701 TTATTAAACA CATGCAGGGT GATAAGTAGT GTTATTCAAA TACATATAGA 
195751 GTTTTGTGTG TGTGTGAAAC ACATATGGTA GATATAAATG ACATATACAA 
195801 GTGTGTGTAT ATATATGCAT GTGTGTGTGT ATATATATGC ATGTGTGTGT 
195851 GTGTACATAT ATATATATAT ATATATATAT ATATATATAT ATATATATAT 
195901 ATATATTTGG AATGCCTATT TGTAGAGAGA GGGGGATTTG AGAGGAAGAA 
195951 CTGAAGAGCA TTCTTAGCAT TTTATGAATA TGATCATTGC CCTGCCAACA 
196001 GAGCATTTCA CCAAGTACAG AAACAGAACC ATTTAAACAA ATAACATAGA 
196051 TAGAAGAGTA TGGATGTTCT AACTGAATCA GATGCCCTAA CTTTGCAAAT 
196101 TCTGCCAGAT GGGAGGTGAG TGCCCTGAAC TAACATCTGG TAATGAATAT 
196151 TAAACAAAGA ATAAAAATAA AAGTTAGTAA AGGCATTTTT TTTCTTCTTG 
196201 CTTCAGGCCT TGGTTTAAAC TAATTTCATC GGATCCTCTT TTCCTTCTTA 
196251 TGATGTAGAA TTTTATTTTC CAGGGAATTG AAGACTCTCT AGAAGAGCAT 
196301 ATTAAACAAT TATTCTTATC ACCGAAGACT GCTTTTTGAC AGGTACACTC 
196351 AGAGACAAAA GAGAAACACC TTTCCCTGGA GGAAATGGGA AATAAGGAGA 
196401 AAGAATACAA AATGTAATTC TAATGTGGTA CAGTCCAGCC ACACTAGAAA 
196451 AAGGTGATGA ATAATTCAGG AGAAAATAAA GTACAAATTT GGAACCTTTT 
196501 GTGAGCATAC TAGCAATCTC AGAAAAAGAG ATAAGTAGAC TTGGAGCAGA 
196551 ATTCAGCTAG ATCATTGGAT GTTGGCTTAG AATTCACCTT TGTTTTGTCT 
196601 GAGTAGTTGG TCAGGCTCCA GTTGAAGCTG AGCGTGGCTG AACAGTCAGA 
196651 GAGGGAAAAT GAGGCCATGA AATCAGATAT AGTGACGTAA AGCAGTGTAA 
196701 AGCACATCTC TTCCTTTGCA CAATAATAGC AAATCTGATT TTTCTCTCTG 
196751 TCTTAATCAT AGTGCTATTT ACCTTGACAC TTCTGGCATT CTTGGTAGGA 
196801 CCACATCCTA GACTCCCAGA ATTTTCTTAA CCTTGGAGAT AAATTTAAGG 
196851 TAGAGAAGAT GATAACCATA TAACCCCAGC ATCAGAGTTC AGCTCTGCTT 
196901 GTTTCAATAG TGCTTTGGCA TTCCAGTTTC TAGTCCCTCC TTGCCATACT 
196951 AGGCCTGTGA AACTGGCCTT GTTTTAGCTA CACACTGTAA TAGAAAACCA 
197001 TGCCTATGAG ACAGATTCTA CTAGTATTCA TGAAACTCCT TCTAGTTAGC 
197051 TTTATTCTAA CTCTTATTCT TTCATACTCT gTCCACTATT CCCTCTCTAG 
197101 CCTTGACAAA TCATGACAAT ATTTCTTTCT TCTCAAACTC AGTGCCCTGT 
197151 GAGAGTCTCA GACTACTTCC ACTCTTCTGA ATTCTTTTTT GTCAGTGATG 
197201 GTTTTGGACT TCCCTGTTGT TTCTTTATCT TCATTTAGAC AGACTGGAAT 
197251 TACTATGAGG TTGGCCAGTG ACTCAGTGAT TCAAAAAGTT GAAGCTGTAT 
197301 GTGATTTGTT GGATGCCTTC TTGAATTCCT GCCTCTATTG GCCAATTAGA 
197351 AAGGACAGTT CTCATGCATT GTCTAGGATG CCTTCAGAAA TCGCTGTCAC 
197401 TCAAATTCAT ATTGTAGCTA CCATAAAATA TGCAACTTGG ATAGAATTAT 
197451 TCTCCAAGAG CATTTCTTCA TGAAATCATG TATCAAAATG GCATGCTATT 
197501 TGTGCAAATT TAAGGCGACT TATGAATATT TTGAAACTCT TGTGACCCCC 
197551 TTCTGTATGG AATATGTTCT AAGTTTGAGA GGGGGGGTAT ATAGACTGTC 
197601 ATTTTATGTT AAATGGCAGT GTGAAAGAGA GAAAAAAACA GTTAAGTGTG 
197651 AGCCAACAAT AAACTTGAAG CCTCCGCCTG ACTCAATGGA CCCCCTCTTG 
197701 GCCAAAAGGA CCTCAGAGAA ATCGGAAAAA CCTGAATTCC CAGACTGGAC 
197751 ACGCCTCCTT ATACTCTCAG CCTTTTGGAG TTTAGGCACA GCTGACAAGC 
197801 ATTAAAGTTA AAACAGATAT CATAAGATTG ACACAGCAGA CTCTTTGTGG 
197851 CAATAAGATA TCAAATTATA AACCAGACCT AAGGCCATGC AAGGCAAGGG 
197901 TTAAATCACA CCCTACAAAC CATAACATCT CATGACGTGG ATTCTTAAAA 
197951 ATGATCCAGG TACAGCATGG CTCACTTTCC AAACTGATTC TAGTGTAGTA 
198001 TCACATGACA GATAGCAGAC CCTCAAGCAA ATAAAAACAT TTTACTCCAA 
198051 AATATATTTG GCATATTTTG AAATGGCCCT GCAAAGCTGT CTTTTGTGGA 
198101 GAAAATTTGC GTCTGCAGAG AATCTCCATT AATGCAGCTA GACCTTCCCT 
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198151 TTCTAGGCCT TTTCTGGCTC TAGAAGAGAA AATTAGTCTG *ATACCATTAG 
198201 GTCTCAAAAG AGACATTTAC TGCCTGTTCT CTTGGAGCTC TGTTACCTAA 
198251 GAGGCTTAAT CTACATAACA AGGGCCTTGG TCCCCACAAC CCCTTTATTT 
198301 TAACTCAAGC ATTCATTTCT ACTGACTTTA GGTCCTTAGA CAATAGCTTA 
198351 ACTCTCTCAA CCAATTGTTA AATAAAGAAT CCCTAAAACC CACATGTGAC 
198401 TTACAAGTTC TCACTTCAAG ATGTCCTACC TTTCAGGCCG AACCAACATG 
198451 TACCTTGTAC GTATTGATTT ATGTCCTTGC CTGTAACTCC CATCTCCCTA 
198501 AAATGTATAA AACCAAACTG TAACCCAACT GCCTTGGGCA CACTTTCTCA 
198551 GGATCTCTTG ACATTGTGTT TTCTGGGTCA TGCCACTCAT ATTGGCTCAG 
198601 AATAAACCTC TTTAAAATAT TTCACAGTTT GGATTTTCTG TTAACAAAAG 
198651 AGGAAAAATG GAAAATTAAA CTATAAAGAA CCAGTAAAAA TAAATGTAAT 
198701 TGGCTTTATA TCTTAGAGGA ATATATGTAT AAAAATGTAT TTTGTTGTTA 
198751 TGTTTTGTTG TTCCATATGT TCTCTTGCTA TTGTGATTTT ACAGACTCTC 
198801 TACTTTCTCA TTGGTTGTTC. TCTGGCAGCT CATTTATAAT GTGGATTTTG 
198851 ATGTTTGGTG AATATAACAA TGTGTCAATG AAAAGAGCCA AACTATAAAA 
198901 TATTTACAGA GATTTATTCC GAGCCAAATA TTAGTGACCA TGGCCCGTGA 
198951 CACAGCCTTC AGGAAGTCCT GAGAACATGT GGCAGCCAAG GTGGTACAGC 
199001 TTGGTTTTAT ACATTOTAGG GAGACATGAG ACTTCAATCA AACACACTTA 
199051 AAAAATGCAT TGGTTCAGTT CAGAAAGATA GGACAACTAG AAGTGGGGGG 
199101 CTGGGGTGGC TGCCCTGCGG CAGGGGCTTC CAGGGTATAG ATGGATTGAA 
199151 ATATTTTTTG GTTGAGAATT GATTGAGTTT ATCTAAAGAC CTGGGATCAA 
199201 AAAAAAGAAA ATGTCTGGGT TAAGAAAAAG GACTGTTCTT ATTTGTAGAG 
199251 GAAGCCTTAG CCTTGAGAGA GAATAGGTTG AAAAATGTTT CTTATCAGGC 
199301 TTAAAGTCTG TGTTGGTGTT AATGCTGGAC AGGTATAATG AGGCATCTCT 
199351 GACCCACACT TCCCATCATG GCCTGAACCA GTCTTTCAGG TTAAACTTTA 
199401 AGAGTGCCCC TGGCCCAATC AGGCAAGAGA AAGAAATATA GCGTATTCAG 
199451 TTAGGAAAAG AGGAAGTCAA ATTGTCCCTG TTTGCAGATG ACATGATTGT 
199501 ATATCTAGAA AACCCCATTG TCTCAGCCCA AAATCTCCTT AAGCTGATAA 
199551 GCAACTTCAG CAAAGTCTCA GGATACAAAA TCAATGTGCA AAAATCGCAA 
199601 GCATTCCTGT ACACCAAGAA CAGACAAACA GAGAGCCAAA TCATGACTGA 
199651 ACTCCCATTC ACAATTGCTA CAAAGAGAAT AAAATACCTA GGAATCTAAC 
199701 TTACAAGGGG TGTGAAGGAC CTCTTCAGGG AGAACTACAA ACCACTGCTC 
199751 AACAAAATAA AGGAGGACAC AAACAAATGG AAGAACATAC CATGCTCATG 
199801 AATAGAAAGA ATCAACATCA TGAAAATGGC CATACTGCCC AAGGTAATTT 
199851 ATAGATTCAA TGCCATCCCT ATCAAGCTAC GAAATACTTT CTTCACAGAA 
199901 TTGGAAAAAA CTACTCTAAA GTTCATATGG AACCAAAAAA GAGCCCGCAT 
199951 TGCCAAGACA ATCCTAAGCA AAAAGAACAA AGCTGGAGGC ATCAGGCTAC 
200001 CTGACTTCAA ACTATACTAC AAGTCTACAG TAACCAAAAC AGCATGGTAC 
200051 TGGTACCAAA ACAGAGATAT AGACCAATGG AACAGAATAG AGGCTTCAGA 
200101 AATAAAACCA . CACATCTACA GCCATCTGAT CTTTGACAAA CCTGACAAAA 
200151 CCAAGCAATG GGGAAAGGAT TCCCTATTTA ATAAATGGTG CTGGGAAAAC 
200201 TGGCTAGCCA TATGTAGAAA GATGAAACTG GATCCCTTCC TTACACCTTA 
200251 TATAAAAATT AATTCAAGGT GGATTAAAGA CTTAAATGTT AGACCTAAAC 
200301 CATAAAAACC CGAGAAGAAA ACCTAGGCAA TACCATTCAG GACATAGGCA 
200351 TCGGCAAGGA CTTCGTGACT AAAACACCAA AAGCAATGGC AACCGAAGCC 
200401 AAAATAGACA AATGGGAGAT AATGAAACTA AAGAGCTTCT GCACAGCAAA 
200451 AGAAACTACC ATCAGAGTGA ACAGGCAACC TACAGAATGG GAGAAAGTTT 
200501 TTGTAACCTA CCCATCTGAC AAAGGGCTAA TATCCAGAAT CTACAAATAA 
200551 CTTAAACAAA TATACAAGGA AAAAACAACC CCATCAAAAA GTGGGCAAAG 
200601 GATATGAACA GATACTTCTC AAAAGAAGAC ACTTATGCAG CCAACAGATA 
200651 CATGAAAAAA TGCTCATCAT CACTGGTCAT CAGAGAAATA AATGCAAATC 
200701 AAAACCACAA TGAGATACCA TCTCACATCA GTTAGAGTGG CCATCATTAA 
200751 AAAGTCAGGA AACAACAGGT GCTGGAGAGG ATGTGGAGAA ATAGGAACAC 
200801 TTTTACACTG TTGGTGGGAG TGTAAACTAG TTCAACCATT GTGGAAGACA 
200851 GTGTAGTGAT TCCTCAAGGA TCTAGAACTA GAAATACCAT TTGACCCAGC 
200901 AATCCCATTA CTAGGTATAT ACCCAGAGGA TTATAAATCA TACTACTATA 
2 00951 AAGACATATG CACAGGTATG TTTATTGCAG CACTATTCAC AATAGCAAAG 
201001 ACTTGGAACC AACCCAAATG TCCATCAATG ATAGACTGGA TTAAAACAAT 
201051 GTGGCACATA TACACCATGG AATACTATGC AGCCATAAAA AGGATGAGTT 
201101 CATGTCCTTT CCAGGGACAT GGATGAAGTT GGAAACCATC ATTCTGAGCA 
201151 AACCATTGCA AGGACAGAAA ACCAGACACT TCATGTTCTC ACTCATAGGT 
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201201 GGGAATTGAA CAATGAGAAC ACTTGGACAC AGGTTGGGGA ACATCACACA 
201251 GCAGGACCTG TCATGGGGTG GG AGGATGGG GGAGGGATAG CATTAGGAGA 
201301 TATACCTAAT GTAAATGACG AGTTAATGGG TGCAGCAAAT CAACTTGGCA 
201351 CATGTATACA TATGTAACAA ACATGCACAT TGTGCACATG TACCCTAGAA 
201401 CCTAAAGTAT AATAATATAT ATGTATAAAA ATATATATAT GTATAAAAAT 
201451 ATATATATAT ACATATATAT GTGTGTGTAT ATATATGTAT ATATATACTT 
201501 ATATATGTAT ATACACACAT ATATATGTAT GTGTGTGTGT ATATATATGT 
201551 GTGTATATAT ATATGTGTAT ATATATATAT GTGTGTATAT ACACATATAT 
201601 ATATATATAC ATATAAAAGA GTGCCCCTGG CCAAGGAGTA AGTCCATTCA 
201651 GATGGTTGGC TAGCCTTAGA ATTTTCTTTT TTGTTTATAG ATGTTATACA 
201701 TAACAATGAT ATCCTTGTCC TATCCACAAG AGCCTATTGA ATTCACAAAG 
201751 TAGCTAAAAT ATTGCATGTT TGCAACATAG TGAGATCTTA ATGTTTATTA 
201801 AGAAGAGTTA TTTTAATTAA AACCAAATTG AGAGGAAATA GATGTAGACT 
201851 TTTAAACAAT TCTGTAAGGA CTTATGACTT TGTTTTTTTT ATTATTATTC 
201901 AGGCATTTCT CCACTTCTAC TTTCTTTTTT TTTTTTTGTT TGAGACAGAG 
201951 TCTCGCTCTG TTGCCCAGGC TGGAGTGCAG TGGCCTGATC TCGGCTCACT 
202001 GCAAACTCCA CCTCCCGGGT TCACGCCGTT CTCCTGCCTC AGCCTCCCGA 
202051 GTAGCTGGGA CTACAGGCGC CCACCACCAC GCCCGGCTAA TTTTTTGTAT 
202101 TTTTAGTAGA GACGGGGTTT CACCGTTTTA GCCAGGATGG TCTCAATCTC 
202151 CTGACCTCGT GATCCGCCCG CCTTGGCCTC CCAAAGTGCT GGGATGACAG 
202201 GCGTGAGCCA CCGCGCCCGG CCTCTACTTT CATTTTTATA GGAAACGATT 
202251 AAGACCGTTT TTTGTTTCTG TTTTGGTTGA TAGATATGAT GGAAGATGTG 
202301 GGGATGAGAG AAGGAGACTG AATCGGAAGA GAGATTTTTC GTGGTATCTT 
202351 TAGTGTTGTT GGAAAAAACA GGATTAAAAA GAAGAATAGG TATGTGAGAA 
202401 AAGGAATTCT TTTTCTTTTC TCTTTCTTTC TTTCTTTCTT CTTTCTTTCT 
202451 TTCTTTATTT CTTCTTTCTT TCTTCTTTCT TTCTTCTTTC TTTCTTTCTT 
202501 CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT TTCTCTCTCT CTCTCTCTCC 
202551 TTCCTTCCTT CCTTTCTTTC TTCTTTCTTT CTTTCTTTCT TTCTTTCTTT 
202601 CTTTCTTTCT TTCTTTCTTT CTTCTTTCTT TCTTTCTTCC TTCCTTTCTT 
202651 TCTTCTTTCT TTCTTTCTTT CTTTCTTTCT CTCTCTCTCT CTCTCTCTCT 
202701 CTCTCTCTCT CCTTCCTTCC TTCCTTCCTT CCTTCCTTCC TTCCTTCCTT 
202751 CCTTCCTTTC TTTCTTTCTT TCTTTCTTTC TTTCTCTTTC TTTCTTTCTT 
202801 TCTTTCTTTC TTTCTTTCTT TCTTTCTTTC TTTCTTTCTT TCTTTCTTTC 
202851 TTTCTTTCTC TCTTTCTTTC TTTCTTTCTT TTCTGAAACG AAGCCTCCCT 
202901 CTGTTTCCCA GGCTGGAGTG CAGCGGCACC ATCTCAGCTC ACTGCAACCT 
202951 CTGCGTCCCA GGTTCGAGCC ATTCTCCTGC CTCAGCCTCC CGAGTAGCTG 
203001 GGACCACAGT CGTGCACGCG CCACCACGCC CGGCTCATTT TTTATATTTG 
203051 TGGTAGAGAC GACTTCCCAA ACTGCTGGGA TTTCAGGTGT GAGCGACTGT 
203101 GCTCGGCCAT AAAAAGGAGT TCTTAAGATG TCTTTATCGA GATGGATAAG 
203151 AGGAAAACAA AAAGCCAATT TGGCCATACA GGCAGGTCTT ACTTGACACA 
203201 ATTCTGATAT GCGTGGATTT CAGTTACAAG CACATGGTAT TAAGTAACCC 
203251 TAGTCCCAAA ATATGGTTCA AATTTTGATT ACCACAGAAT ATTAACTGTG 
203301 AATAATTGCA TTATATACCA ACGTTGCCGA AAGCTCTTCG TGCACAAACC 
203351 ACTAAGTAAA TAATAGGCTG CTATCATGGT CAATGGCCAA TCGGGTCACT 
203401 TCTTTCAAGG TCTATGGTGA GTGGTTACTG CACGTCTATT ATTCCATTCA 
203451 TGCACAGACA ACAAAGCATA TAGTTGTGTT ATGTCCTTGT CTGCCCCTGA 
203501 TTAATCCACA TGCCATTTTA CAAAATTAGA CAATCAAAAC AGAAGTGACC 
203551 CACAAAGTTC AAAATGGAGT AAAGAAAGAG AATGTGCTAA TTCTGGAAAA 
203601 GAAATTAAAA TTAAACATAA ATGAAGTTAT AGAACAAAGA GCTGACCTTG 
203651 ACATTGCCAT CTTTCGAGAA TCTCTGCACA TGCAGCCAGA TACACTGAGT 
203701 GAAGAAAAAC TTATTTATTT TATTTTATTA GGGAAACTTA TTGACATAAA 
203751 GTGGTTGTGA TAAAGAGGAT GAGGATCACT TTGGAAGGCC GAGGCGGGCG 
203801 GATCACGAGG TCAGGAGATC GAGACCATCC TGGCTAACAA GGTGAAACCC 
203551 CGTCTCTACT AAAAAATACA AAAAATTAGC CGGGCGCGGT GGCGGGGGCC 
203901 TGTGGTCCCA ACTACTCGGG AGACTGAGGC AGGAGAATGG CGTGAACCCG 
203951 GGAGGCGGAG CTTGCAGTGA GCCGAGATCG AGCCACTGCA CTCCAGCCTG 
204001 GGCGACAGAG CGAGACTCCA TCTCAAAAAA AAAAAAAAAA GGATGAAGAT 
204051 GTCCTAGAGG AAGTGATGCC AGCAAAAACT TTCAAATTGA AGGAACTTGG 
204101 AGATATTTCA TAATACTGAA AGTGCAAAGG TTAAAAATGT TGGAAGCTAA 
204151 TACAAACTTT GGAAGGAATG TGACCATTTG CCAAGATGTG AAAAGAAGCT 
204201 TGTTCTATAT TGTAAGTTAC ACAATGAGAA GAAAAGGGAA GCACTGTTCA 
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204251 AAGAGCTTAT CAAGAGACAA GTTTTTTTTA CAAAGAAATA TGACGTTTTA' 
2043 01 ATTCTCACTG CCTCTAATGT TTTAAATTAC AGTGTACTAA AAAATAGT1T 
204351 TACTGTTTTT TCCATCACCC TATACATATA TAACTGATAG TAATAGGGTT 
204401 TTTAATATTT TGACAAAAGA TTTAAAGGTC ACAGAACAAT TTAATTTTCC 
204451 CTAGTGATTA TTAAAGATTA CTTTGCACAG TATCAGCTTG CACAGTCATT 
204501 TTTATGAATA CTGCCTGCTG TGCAAAGTGA AGACTGTCTG CATTCCCATG 
204551 AAAAGAGAAA GAGACATTGA TCAGATCTTG CCCTATACAC AAATGAGATG 
204601 TTAAGATTGG TCCACAGCAG ATTGATTGGC AGTCTTGAGT AACGTGGGAA 
204651 CCCATGCATA CAGGACATAT GCAAATTAGA GTCCTGGAAT TTGATTTTAC 
204701 TCTGAAACAA CAGAAATAAC ATTCTCTCTT AGACCAAATT CTTAATAGCT 
204751 TGGAATCATT GACACTTAGA TTGGAAAAGG ATATTATCTG GTTTAGTTTA 
204801 CAGGTAATGA CGTTGAGGCC TGTACAGGTA AATGAAGTGG CTCATCTGAG 
204851 AGCACATGCC TGGTTAGTGA CAGAGGGATT AAAAAGAATT GAGATCTCTT 
204901 GACTCCTGGT CTACCCCTTC TTCGTGGAAA GAAACCAAAG CACATTCTTT 
204951 ATGAGACCAC CATGGAAGAT GTTGCAAGGC CTTCGAAGTA GGGGATGATA 
205001 CCTGTTCATC GCATGGCTCC TGTAGACGTA ACTAACAGCA GTATTGAAAA 
205051 AATGCAAGCT CCCAGTGAAT CTCTGTTTCC TTTTAATGCT GCAGTTTGTT 
205101 TTCTGGCAGC CTGTTAGCTT GCCTCAGAAG AACATTTGAA GTAGGGCTAG 
205151 ATTTCCCTTG ACATTGTTAA ATCCAGATCC TTTCCAAAGT AATTTCTTGA 
205201 AAGCTATTTA AGTTGTCCCA CTAGATGAAT ACCTGCTGAT GACTGTTCTA 
205251 TTCACCAACT AATTCCTGAG CTTTTCATTG CTCGATTGCC TATTTAGTTT 
205301 GTGTAATTTA ACACTTTGTT GTCATATTTG TCTTAACTTT TTCATTTTTA 
205351 TTTTCTTCTT ACCAGAGGCA AGCAGAGAGA CAAGGGAGAA TGATTTATGA 
205401 ACCTTTCTAG CTTGAAATTC TTGGCCCTTT TTGTTTCTCT TTGGAATTTG 
205451 AATAATTTGA GATGTATAGA GGCATGAATA AGGGGTGGTT ATACAGATTT 
205501 AATTTTGACT TTTTTGAAAT TCAGTACCAA ATAACATGTT GATATTCTCA 
205551 TGAATTCAAT AATACACTTC AGCAAACACA TTTCGTGTTA CCCTATTTTG 
205601 TTATTTATTG TGTGATGACT TTCTGAAATT GTTATTATAA TAGCTATTTA 
205651 CTTGCAACTT TTGTTTTTTA TTTACTAATA GCTAAGCTAC AGTTAAAAGG 
205701 ACCTGAGATG AAGCTGGGAA ATAAAAATAA CAGTACAAAT TAGCATATTG 
205751 TAAATATCAA ATAGTCTTAT ATATACACAT CTTTGCCCAC TGTATTCCCT 
205801 TGGTTTGACT GCTCTTATTT CTCTGTCTGA TTTACCCTAT TTTTCTACAA 
205851 GATGCAATGG GGCCCTTTCC TCCTTTGAGA AGCCTTCTCT CACCTCCTAG 
205901 ACTGGCTAAA TGACCCATCT CTCTGCTTTT AGAGAGCCTT GTTCCTCTCT 
205951 TCTTACCACA GAACCCCTTG TGCTTTACGC CAGTAGGTGT TCTCCTGCAC 
206001 ACCCAGTTTG GCCTTGGAGA TGTCTTCCTG GGAAGAGTTA CTTCAGTGCC 
206051 TCAGGTATTT GGGTTCTTAT CATTGCTAAA GGGTACATTA CCAGCATTTT 
206101 AGGAGGGAGT TTTTTTCCTC TTCAAATTCC CTGAAATAAT GATGTGTCTT 
206151 AGACCTGTTT TCTTCCATCT ACTAGGTGTC AATCCTTCTA GGTCACGGAT 
206201 ACATTTTTGA TAGATGTCAT AATGTCATTG TACCACCAAC CCCCTTCTAC 
206251 CTGCCACACT TTAGCATCTG TACTTTGGAC CATTTATTTG AAAACAATTC 
206301 CTTGAAGAGT CTCCAGTGGG ACATTTTTCC TTGTGTGTAT GTGTGTGCAT 
206351 GTGCATACAA GTGTGGTTGG CTTGGTGCCA TCTCATATAT ATGGCATTTA 
206401 CATTCTGCAA CATTGCCCTT TAATGAAAAC CCCAGTTCAC GGGAAAACAG 
206451 TTTTTCCTTG TCAGTCTGCT CATAGTTCTC TAACCAGGCC TTTTACATTA 
206501 TTTCTTCTGC ACTTCTTCTC TTCCTGGCCT CCTTCTTTAG ATAATCTACA 
206551 TCCTTGTTCT GTTCTTAAAA CAATAATATT CATCCTTCAA GGCCCAAATC 
206601 AAGCCTCATG CCCTCCAGAA TGCTTTCCTC CTAATAACAT GCATTCTGGA 
206651 CAGCATCTTC CTCAAAATTC CTGCAGCCTA CTTCAGGGTG CCTACTATGA 
206701 GAGAGCCAGT GTAAGAATTG GAAGCAGCGA GATTAGATCC ACTATCTATT 
206751 GGATAGATAT GTAAGTAACC AGATGTTATA GACAGAATAA TTCTCTCCCT 
206801 CACCCCACCA AAGATGCCCA TGTCCTAACC CCTAGAATTT AAGAATATGT 
206851 CATTATGTGG CAAGAGAGAA TTGTTGAGGA TGGATGTGGA GTCAGGATTG 
206901 CTAATTAGCT GACCTAAAAA TAAGCAGATT ATGTTGGATT ATCTGGGTGG 
206951 ACCCAGTGTA ATCACAAGTG TCCTTGAAAG TGGAGGAAAA CCGCAGAAGA 
207001 GGAAGTCAGA CTCAGAATGG GTTTTGAAGA TGCCACACTG GTGGATTTGA 
207051 AGAAGGTTTG AAAGCAGCAT TTAGAAACTG AAAATGCCAA GGAAACGGAG 
207101 TTTCCCCCTA GAGCCACCAA AGAATGCAGC CGTACTGACA CCTTAATTTT 
207151 AGTTGTGTGA AAGTTATTTT GGACTTTATT CTATAGAACT GTAAGATAAT 
207201 AAATTTGTGT TGTGGTAAGC CACTAAATTT GTGGTAGTTC GTTAGAATAG 
207251 CAATAGGAAA CTAACATATC CAGCTTAATG CCCGACACCA TTTAACTCAA • 
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207301 TTATTTTCTA AAATAAGAAG AGATAATATA TATTGTGTAA CTTATCATAA 
207351 GACATTTTAA TTTATGTTAT ATGTAAATGT TACTAAACAT TCTCAGCCAA 
207401 TGAGAACTAG GGAGACAGAG TAAACTGCAC AATTCTAGAC CAATGAACAT 
207451 TAGCAAACTA CACTGTCTCT GCTCTCAGCC AATGAGAGTT CTACTTTGCT 
207501 ATTTTCAGTG AGTGTACTTC GCAGCGTGGT TCACTCCACC AAGAAATTTA 
207551 CGTTTGCCAT TTAGTTTATG CAATATAGAA CAATTTTGAC TCGTAATGGC 
207601 TGGCAAAAGT TTATAGCACT TCTCCTTGCT TCTATACTCA TTCATTTCTT 
207651 AGTAATCACA GTGATCTTTC TATGCCGTAA ATGAGATTAT GTCATACCTC 
207701 ATCTTAATAG CATTCAATGG CTCCCATCTG CCTTCAGAAT AAAGTCTAAA 
207751 TTCTTTAGCA GTGCCTCCTG TAAATAGCTA CCTCCTTAGC CTCGTCTCTT 
207801 ATTATTCCCC AGTGTCAACT GTCTACTTGC AAGTTCCACT GACTGTTTAA 
207851 TACCATCATT CTCCCAGCAA ATACCCACTT GGAATCTGCT TTGTAACTTT 
207901 GCCTGGGATG CTATATTTGA ATAAAATTCA CTCATCAACT AAATACCCCC 
207951 TACTCAATTC CTTTGCCTGT GATCTCCAAT GCTATAGAAA ACCTATGTCT 
208001 TTCTTCTTGG CCTGGTAGAA ACCACCTTGG CCAGGGCATG GCATAGCCAG 
208051 TATATAACTG ATTCAATGAA AATAACATAT TCAGAGCATT ACCTCTAGGG 
208101 GCAACCATGA AAGGGAGAAC TGGCATCAGT GAAGCAGATG GTTTTTGCAT 
208151 TTGCACCTGG GTCAGTCTTA GCTTCTGCAT TGATACTCCA GGTTATATTC 
208201 TCAAGATCTG GATTTCTGAA CACCTTCCCT TTCCTCAGTC TTCTTTCATA 
208251 AGCCTGGCAG CTTCGATTAA TACTTTTGGA AACTCAAAAC AGAGTATCTG 
208301 ATTTTGCTTT AATTTCTGAA AAATGAGAAT AATTTCCCTC ATTCCTAAGA 
208351 GTACCTGGTC ATTTTAAATT TCTCTATGCT CTCCTCTGGC ATAGAATTTT 
208401 ATCTGGAGTC TAGATTTAAA TATGATTGCC TACCTAGGTG TGATTTCCTT 
208451 GAGACTTCCA AGTGGTATTC TAGCACTGCC CTTCCCTGAA GAGAAGGAAC 
208501 TAGGCAATAT TCTACTTAAC TTGGCTGATC TGAGCTACAT AAGCCACCCA 
208551 CAGAAAGGGG TATAGAGAGG TTGCTGCAGT TTGAGCTCAG CAGGACACAT 
208601. TGGCTTGGTC TGCTACATTT TTTCTGATGG CTTGTATTGG CTTCCATTCT 
208651 TCAAGATGGC ATCTCCTTGG AATTTTTATA ATCTTTCTTT GCCTACTCAC 
208701 TAGTCTAATG TCTCTAGACT TAGCAGCCCT TCAGTAGGTA TGAGTCCCCC 
208751 TGTATCATCT GGAAATTGTC ATATGTGAAA GCAAGTGTTA TGTTTAGGAA 
208801 GATTTTCTTC CCCATTAGGC AGGTGCTGAT GAACTGACAG GCTGGGGTTT 
208851 CTCCATTAAG TGTGTGTTTC TTTAAGAGTA ACTAGGGTCC TTTCCATTTC 
2 08901 CCAGCTTCTC TAGGTATAAG TGGGGGAGTG CAATCTTCTC TCAGCCAAAT 
208951 GCAAAGGGAA TCACTTGAGC GCCTCTGGTC AGACTTCCTT GAGGGTGGGT 
209001 GGCTGGCGTG GGCAGGATGA GAGCTATCTA CATGTAGGAG TAAAAATAGT 
209051 TCCTCCTTGT GTCCTGAGGT ATCTGTTGTA GTTGTGTAAC TTTCTGCCTG 
209101 TATGTGTTTC CCTTCTTGTA TTGATAAGCT TATGAATTAA TTCTTCGGTA 
209151 CAATTTTGAT ACAAGTAGTA CAAGCATGCT GAACTCACCA CCCAACCTAA 
209201 ACACTCTCTA TACTGTTGAA GAGCATGACT TTGACGCTGA ACACCATTCT 
209251 GCCTCATGGA CAGATACTGA TTGGATGTTT ATTTCACTGG ATAGATTCCA 
209301 CCAATCACAA TCTGTTGCTG AACCCTAGCA CAAGGGTCTA GAGTTTCTTA 
209351 CGTACTACCT GCTTTGTGGT TTACTTGCTC TAACTGGGGC AGAATGAAAC 
209401 CTGCAGTTGT TAGAAGCCAG CTTGGAATGG CAGGAGATAC CAACATTGCT 
209451 AAAAACAGCC ATCACTTAAA TGGTCAGCCA GATTAATGCA GTATATATAG 
209501 CCACAGCTTA TAAAGACTGT CTTTGGTTAT GAATTATTTA AATCATTTCC 
209551 TAGAATATTG AGGTAGATTT TTGACTCAAG CCATTGCAAT ACAACTTTTA 
209601 TGAGCCTTCT TCCACATGCA AAAAAATGGT TGATGTTCTC AAAAGCCTAT 
209651 AGGCTGGTAA GGCAAAGACT ACTTTCTTAC TGAATGTAGC TAAGTTTCTT 
209701 ATCTATGGCC TACCATATGG AGAGCATTAG ACAACTTTTG CATAAATCTC 
209751 CTTTTTTCTT AATGGAAGTT GAATATTGGC CTTGGGGGAT CCCTGACAAA 
209801 TCAGGCATCA TCTGTTTATA CCATTCTTCT GAATGGAACT CTTGGACTTG 
209851 AGCTGTTTGG TTCCACAATG AGTTGCAGGT GCTGAGAGGG ATTGATGACG 
209901 CTCTACTTAA TTGCTTGTCA CACAGATAAT CTGATGAGGG GGCATCAGTG 
209951 CCAGCAGCTG GCAGCTTTCT CACCCATGAC CTGAGGCAGA TTCTTGGGTT 
210001 AAGAAGGTTG TGAGATAAT^A CAGTAAGAAT TCCTCCCTTA TAAATGTTTC 
210051 TAAGGACATT AGTTGGGGCG GGCGGTGTTT CTGGCTTAAT TTTGAGAATA 
210101 CAATGAGAAC AAAATATGTA TTTCAAAGAT CATAGGTTTT GGTATAGGAG 
210151 AGTCCTAGGT GAGCATCCTA GTTTATTTTC ATTCATTGTG TGCCCTCAGG 
210201 CAAGCTATTA AACTTTTCTG AGTCTTGGTT TCCTCATTAA TAAAATGAGA 
210251 ATAAAAATAT CTAATGTTAC AGGATTGTTA GAACTCGTGT GAACCTAAGG 
210301 GTAACACTAA GTGTGGTGGG GGCCAGTGCT TCTCAAAAGG TGAGGCTGGA 
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210351 CCAGCAACAT CAGCATTGCC AGGGAGTTTG CTAGAAAACA CACATTTTTG 
210401 ACTCTCTCCC ACTCCTCCTC AGACCTGCTG AAAGAGAAAC CCTGAGGCCC 
210451 AGCAGTCTGG TTTAACAAGC TCTCCAGGTA ATTGAACTAA AGTTGGAAAA 
210501 TCCTGATGTA GGCTTTCAAA AAGTGATTGT TAAGGTGAAC AGAAGGGAGC 
210551 CAATATTCAA AATGCTAAGT GATCACAAGA AGTCAGAAGC CAAGGTCTAG 
210601 CCCCAGAAAG GGTGAGAAGA GCTGATGAGC AGGAAAGACT GGGGAAAACC 
210651 CATCCTGTAA GGAGCACAGT GGGATTCTGA GGACACAGTT GAGTGTTCCA 
210701 GACAGGACAG CCTCTGGAGC TCCAACAGGG TTGCAGGCCA GCCTGTGAAC 
210751 CCTTGTTCCC TTTCTTGCTC CTCAGTGAAG CCTAAGTTTA CCTTCCTTCA 
210801 TGTCTGACTG CTTGCAAATC TATCGAATCT TGTGGAAACT GTTTTTTTCC 
210851 CCCAAATAAA TAATATTTAG ACTTAAAGTA TTTAGAATCT AAACATAAAC 
210901 TTTTACAATG AATTAAGTGT ATAAGATCCT ATTGAAAATA ATGGAAAATA 
210951 TTTTCTATTC ACATAGGTTA TCCGTGAAAA AGAGTAATTG CTATGTTATA 
211001 ATTACGAGCA GTATAAATAG ACCTGCTTTT ATGCTTTTAA TGAGGGTTTT 
211051 GAATATATTT GACCAAGGTA TTAGGAATTA TAATACAGCA ACCTTTTCAA 
211101 TATCAGTAAC AATATCTAAA TCTTATATTG CTTTTTACAG ATTCCAATGT 
211151 TTTCATGTTT ATTCACTCAA TCTGATTTCT ACACACAGCA GATCAAAAAA 
211201 AGTTGTGAGT TCAGCTTTCT CCACTGTTTG GAGGATACAG CTTTAGAGAG 
211251 GTGTGAAGCT GAGACCATTG GCATTTGGAC ATCAGTGGAC ACTGTGAGGT 
211301 GCTGCATCAT TTTAAGACTG GAGTCCACAG CACTTGTGAG GAGGAGGATG 
211351 ACAAACATCA TCTGTCTAGG CTCAGTCTGG AAGAATTTCT TCAACACTCA 
211401 GCTGGGGCGC CATCCACAAT CTCCTGGAAA CGCTCCAACC CTGTAATTAT 
211451 ATTCTGCCTC TCAAAATCAG ACTTCGTTTT CTAATTCAAT AATGCTCCAT 
211501 TTTATCATGC TCATTTTAGA TACTTTCGTC TCTCATTTCA AGAGTTTAAA 
211551 TAACAGACAA CTATCACGAC TTTGGGAGGA GGTAGGTGAG ACAGAGCAGT 
211601 GCTCCAGAAT TAGGTCCCAC CCCCGAGGCC TCTCTCAGCA ATCAGGCTAA 
211651 GACACCCCAC AGGCTGTCTC TTTCTGCCTC CCCTTTCCCA CAACACAATC 
211701 ACCAGCTTCT CCAATGCATC ACCTGGCCTG AATGCATCGC CTGGAGCATT 
211751 GCCTGGAGCA TTGCCTGGCC CGAAGGGCAG CACCACCCCA TGGGACCATC 
211801 CTGGATGATT AGGAAAGAAG AAACTATTTC TGCCAGACTT TCTTTGTAGT 
211851 TTCGTTTCAG AAACCTCATT TAGTTCTTCA GCTGTTTTAA GCAATGTTTC 
211901 TGAAAGAAAA AACTATCATT TAGGAGAATC GTTTGTTGAT GCTCATATGT 
211951 TGCCTTTGAG TGTTACCCCT TCACAGTTGC TATGGGGAGA TACAAGCAAA 
212001 ACAAAACAAA ACAAAACAAT ACAAAACAAA ACCCAGTAGA TTTAGTTCCT 
212051 CTTATTACTC ATTCCATTAA TAACCCTTGA GTACTTATTA TATACAAGGC 
212101 AATGTAGAAT ATGAATCCAT GAATCACATC TAGAATTTGT CTATAAGAAA 
212151 CATGGGCTAT AAAAATAGGC TGTGTTTACT AGGCTGATAA CAAATCCTTT 
212201 CTAAAACTTA TCCTACATGA GGTAAAATCA ATGCAGAGGT CAAGAATCTC 
212251 AAGCGGTATA CAGTCTAGGT GAGGCCATAA AGATGAATCT CAGGACAGAC 
212301 CTGCTTCAAC TGAACTAGAC ATGTTACTAT TAGAATAGTT CCACTGGAGA 
212351 ACTGGTTGTT TAAAGACAGT TCAGACACAA ATCTTTTGCT ATACAAAGAA 
212401 AAGAAGACCC CAGATTTAGA GTTACGTCTA TTGTTAAGTG CAGAACAGCT 
212451 GCATGCTATG TAAGGGTTGT TCACTTTCAA TCCCCAGGGC ACCATTTACA 
212501 TAGACTGGGA CATGGCCTGC- AACGTGCAGT CTATGCAATT GCATGCAATA 
212551 GCTCTATATG AATGTTTCTG GAAAAAAATA GACTTTATCT GAAAGGATGA 
212601 CTTACACCTC CTAATTTAAC TATTCACATT TTGGGATTTG CTTGGAGCCA 
212651 AACCAGATTT CTGGCTTTTT GCTGCCAAAC CAGATTTTAG GAAAATCCTG 
212701 CAAATATGGG TTTGACCAAA GTTTTGTACT CTTAACCTCC TGCTAAAATG 
212751 GTCTAGTCCC CACTTTTAGT AACAATCTCA CCCCCGCCAT CACTCTAAAT 
212801 ATCCTAACTG GGAAAGCTAC CTATGAACTT TTCTCTCTGG TCAATGATTA 
212851 TGATCTTTTC CCTTTTTTCT TTGCAAAACC TTTCTGTTTT CTGTTCTCTG 
212901 CTTGAAAACT GAAAATATAG TTTATATTCT GGGTTGAAGA TGTTAATGTG 
212951 CTTTATAATT ACTTCAGATA ATATACATGC CTTGGACAAA TGACTGTTTA 
213001 ACTCCCTGTC CTCAGGTTGT GGTTTTTGGC AAGAGCAATT TTTTTTTCAT 
213051 AACCTTTGTC TTCTGACCCT CCATTAATCC ACAGTTTACA CCTTCCCTTC 
213101 CTTACCCTTA GGTCTTTTCT TTCTTTTTTT TTTTTTTAAT TATACTTTAA 
213151 GTTCTAGGGT ACATGTGCAT AATGTGCAGG TTTGTTACAT ATGTATACAT 
213201 GTGCCATGTT GTTGTGCTGC ACCCATTAAC TCGTCATTTA CATTAGGTAT 
213251 ATCTCCTATT GCTATCCCTC CCCCCTCCCC CCACCCCACA ACAGGCCCCG 
213301 GTGTGTGATG TTCCCCTTCC TGTGTCCATG TGTTCTCATC GTTCAATTCC 
213351 CACCTATGAG TGAGAACATG CGGTGTTTGG TTTTTTGTCC TTGTGATAGT 
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213401 TTGCTGAGAA TGATGGTTTC CAGCTTCATC CATGTCCCTA CAAAGGACAT 
213451 GAACTCATCC TTTTTCATGG CTGCATAGTA TTCCATGGTG TACAGGTGCC 
213501 ACATTTTCTT AATCCAGTCC ATCATTGATG GACATTTGGA TTGGTTCCGT 
213551 CTTTGCTATT GTGAATAGTG ACACAATGAA CAGCGTGCAT GTGTCTTTAT 
213601 AGCAGCATGA TTTATAATAC CCTTACATCT TTTCTCACTC TAACTTCCCC 
213651 AGGACTTCCT CTCAGTACAC ATCTGTAAGT TTAAGTCATT CCTACTTCAT 
213701 AQCCCTTCAT AATCCTTCAT CCACAGGTTG GATGAAAGTA TAGATTGGTA 
213751 AAAGGAGACA GGAGACTCCG TGATGGCCTT TGGAACTCCA GGTCCTGTGG 
213801 AGTAGAGGGA GAGGAGTAGA GATGAAGCTG AGGTCTTCAA ATGCAGATGG 
213851 GGTGGAGGTA ATACTTCAAG TGCTGTGATG CATTCTCATG AAAGTAAGAT 
213901 GTAGAAACTC TCTTGATGAC TTCAACTTTG AAATGCAACT TTTCTATCAG 
213951 ATGTGTATAT ATACTCTTTA TGCACGTAAA TACTCATTTC CCAAAGGAGA 
214001 CGTTAATCTT TGTCTAAGCC CTAAAGATGG AGATACTTTT CTTATTTATT 
214051 TACTTATTTA TTTTTTAATT TACTTCAAGT TCTGGATACA TGTGCAGAAC 
214101 GTGCAGGTTT GTTATATAGG TATACATGTG CCATGGTGGT TTGCAGCACC 
214151 TATTAATCCA TCATCTAGGT TTTAAGCCCC ACATGCATTA GGTATTTGTC 
214201 CTAATACTCT CCCTCCCCAT ACCCCACGAC CCCCGACAGA CCCCAGTGTG 
214251 TGATGTTCCC CACCCTGTGT CCATGTGTTC TCATTGTTCA GCTCCCACTT 
214301 ATGAGTATTT TATTCTCTTT GTAGTAATTG TGAATGGGAG TTAACTCATG 
214351 ATTTGGCTCT CTGTTTTTCT ATCATTGGTG TATAGGAATG CTTGTGATTT 
214401 TTGCACATTG ATTTTGTATC CTGAGACTTT GCTGAAGTTG CTTTTCAGCT 
214451 TAAGGAGATT TTGGGCTGCA GTTTTTGGTT TTGTGTCCTT GTGATAGTTT 
214501 GCTTAGACTG ATGGTTTCCA GCTTCATCCA TGTCCCTGCA AAGGACATGA 
214551 ACTCATCCTT TTTTTATGGC TGCATAGTAT TCCATGGTGT ATATGTGCCA 
214601 CATTTTCTCG ATCCAGTCTA TCATTGATGG ATATATGGGC TGGTTCCAAC 
214651 TCTTTGCTAT TGTGAATAGT GCTGCAATAA ACATACGTGT GCATGTGTCT 
214701 TTATTGTAGA ATAATTTATA GTCCTTTGGG TATATGCTCA GTAATGGGAT 
214751 TGCTGGGGCA AATGGTATTT CTAGTTCTAG ATCCTTGAGG AATTGCCATA 
214801 CCATCTTAAT AAATTTGTTA TTTGTAGATT CTGGATATTA GCCCTTTGTC 
214851 AGATGGGTAG ATTGCAAAAA ATTTCTCCCA TTCTGTAGGT TGCCTGTGCA 
214901 CTCTGATGAT AGTTCCTTTT ACTCTACAGA AGATCTTTAG TTTAATTAAA 
214951 TCCCATTTGT ATATTTTGGC TTTTCTTGCC ATTGCTTTTG GTGTTTTAGT 
215001 CATGAGGTCT TTGCCCATGC CTATGTCCTG AATGGTATTG CCTAGGTTTT 
215051 TTTCTAGGGT TTTTATGGTG TTAGGTCTTA CATTTAAGTC TTGAATCCAT 
215101 CTTGAGTTAA TTTTTGTATA TGAGGTAAGG AAGGGATCCA GTTTCAGCTT 
215151 TCTGCATATG GCTAGCCAGT TTTCCCAGCA CCATTTATTA AATAGGGAAT 
215201 CCTCTCCCCA TTGCTTGTTT TTGTCAGGTT TGTCAAAGAT CAGATGGTTG 
215251 TAGGTGTGTG GTGTTATTTC TGAGGCCTCT GTTCTGTTTC ATTGGTCTAT 
215301 ACCTCTGTTT TGGTACCAGT ACCATGCTGT TTTGGTTACT ATAGCCTTGT 
215351 AGTATAGTTT GAAGTCAGTT AGCATGATGC CTCCAGCTTT GTTCTTTTTG 
215401 CTTAGGATTG TCTTGGTTAT TCGGGCTCTT TCTTGGTTCC ATATGAACTT 
215451 TAAAGTAGTT TTTTCCAATT CTGTGAAGAA AGTCTTTCGT AGCTTAATGG 
215501 GGATAGCATT GAATCTCTAA ATTACCTTGG GCAGTATGGC CATTTTCACA 
215551 ATATTAATTC TTCCTATTCA TGAGCATGGA ATGTTCTTCC ATTTGTTTGT 
215601 GTCCTCTTTT ATTTCATTGA GCAGTGGTTT ATAGTTCTCC TTGAAGAGAT 
215651 CCTTCACATC CCTTATAAGC TGGATTCCTA GGTATTTTAT TCTCTTTGTA 
215701 GTAATTGTGA AAGGGAGTTC ACTCATGATT TGGCTCTCTG TTTGTCTATT 
215751 ATTGGTGTAT AGGAATGCTT GTGATTTTTG CACATTGATT TTGTATCCTG 
215801 AGACTTTGCT GAAGTTGCTT ATCAGCTTAA GGAGATTTTG GGCTGAGATG 
215851 ATGGCGTTTT CCAAATAAAC AATCATGTCA TCTGCAAATA GGGACAATTT 
215901 GACTTCCTCT TTTCCTAATT GAATACCCTT TATTTCCTTC TCCTGCCTGA 
215951 TTGCCCTGGC CAGAACTTCC AACACTATGT TGAATAGGAG TGGTGAGAGA 
216001 GGGCATCCCT GTCTTGTGCC GGTTTTCAAA GAGAATGCTT CCAGTTTTTG 
216051 CCTGTTCAGT ATGATATTGG CTGAGGGTTT GTCATAAATA GCTCTTATTA 
216101 TTTGGAGATA TGTTCCGTCA ATACCTAGTT TATTGAGAGT TTTTAGCATG 
216151 AAGCGCTGTT GAATTTTGTT GAAGGCCTTT TCAGCGTCTA TTGAGATAAT 
216201 CAATTGGTTC TGTTTATATG CTGGATTACG TTTATTGATT TGTGTATGTT 
216251 GAACCAGGCT TGCATCCCAG GGATGAAGCC CACTTGATCA TGGTGGATAA 
216301 GTTTTTTGAT GTGCTGCTGG ATTTGGTTTG CCAGTATTTT ACTGTGGATT 
216351 TTTGCATCGA TGTTCATCGG GGATATTGGC CTAAAATTCT CTTTTTTTGT 
216401 TGTGTTTCTA CCAAGCTTTG TTATCAGGAT GATGCTAGCC TCATAAAATC 
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216451 AGTTAGGGAG GAATCCCTCT TCTTCTATTG ATTGAAATAG TTTCAGAAGA 
216501 AATGGTACCA GCTCCTGTTT GTACCTCTGG TGGAATTCAG CTGTGAATCC 
216551 ATCTGGTCCT GGACTTTTTT TAGTTGGTAG ACTATTAATT GTTGCCTGAA 
216601 TTTCAGAACC TGTTATTGGT CTATTCAGAG ATTCAACTTC TTCCTGGTTT 
216651 AGTCTTGGGA GGGTGTATGT GTCCAGGAAT TTATCCATTT CTTCTAGATT 
216701 TTCTAGTTTA TTTGTGTAGA GGTCTTTATA GTATTCTCTG ATGGTAGTTT 
216751 GTATTTCTGT GGGATCGGTG GTGATATCCC CTTTATCATT TTTTATTGTG 
216801 TTTATTTGAA TCTTCTCTCT TTTCTTCTTT GTTAGTCTTG CTAGCGGTCT 
216851 GTCTATTTTG TTGATCTTTT CAAAAAACCA GCTCCTGGAT TCATTGATTT 
216901 TTTGAAGGGA TTTTTGTGTC TCTATCTCCT TCAGTTCTGC TCFTAGTTAT 
216951 TTCTTGCCTT CTGCTAGCTT TCGAATTTGT TTGCTCTTCC TTTTCTAGTT 
217001 CTTTTAATTG TGATGTTAGG GTGTCAATTT TAGATCTTTC CTGCTTTCTC 
217051 TTGTGGGCAT TCAGTCCTAT AAATTTCCCT CTACACACTG CTTTGAATGT 
217101 GTCCCAGAGA TTCTGGCACA TTGTGTCTCT GTTCTCATTG GTTTCAAAGA 
217151 ACATCTTTAT TTCTGCCTTC TTTTCCTTAT TTACCCAGTA GTCACTCAGG 
217201 AGTAGGTTGT TCAGTTTCCA TGTAGTTGTG TGGTTTTGAG TGAGTTTCTT 
217251 AATCCTGAGT TCTAATTTGA TTGCACTGTG GTCTGAGAGA TAGTTTGTTG 
217301 TGATTTCTGT TCTTTTACAT TTGCTGAGGA GTGTTTTACT ACCAATCATG 
217351 TGGTCAATTT TAGAATAAGT GCGATGTGGT GTGAGAAGAG TGTATATTCT 
217401 ATTGATTTGG GGTGTAGATT TCTGTAGATG TCTATTAGGT CCACTTGGTG 
217451 CAGAGCTGAG TTCGAGTCCT GGATATCCTT GTTAGCCTTC TGTCTCGTTG 
217501 ATCTGTCTAA TGTTGACAGT GGGGTATTAA AGTCTCCCAT TATTATTATA 
217551 TGGGAGTCTA AGTCTCTTTG TAGGTCTCTA AGGACTTGCT TTATGAATCT 
217601 GGGTGCTCCT GTATTGGGTG CATATATATT TAGGATAGTT AGCTCTTGTT 
217651 GAATCGATCC CTTTACCATT ATATAGTGGT CTTCTTTGTC TCTTTTGATC 
217701 TTTGTTGTTT TGAAGTCTAT TTTATCAGTG AGTAGGATTC CAACCCCTGC 
217751 TTTTTTTCTT TTTGCTTTCC ATTTTCTTGG TAGATCTTCC ACCATCCCTT 
217801 TATTTTGAAA CTATGTGCAT CTGTGGACAT GAGATGGGCC TCCTGAATAC 
217851 AGCATACTGA TGGGTCTTGA GTCTCTATCC AATTTGCCAT TCTGTGTCTT 
217901 TTAACTGGGG CATTTAGCCC ATTTACATTT AAGGTTAATA TTGTTATATT 
217951 TGATCCTGTC ATTATGATAT TAGCTGGTTA TTTTGCCCGT TAATTGATGC 
218001 AGTTTCTTCA TAGAGTCAAT GGTCTTTACC ATTTGGCATG TTTTTGCAGT 
218051 GTCTGGTACC AGTTGTTCCT TTCCATGTTT AGTGATTCCT TCAGGAGCTC 
218101 TTGCAAGGCA GGTCTGGTGG TGACAAAATC TCTGAGCATT TGCTTATCTG 
218151 TAAAGGATTT TATTTCTCCT TCATTTATGA AGCTTAGTTT GGCTGGATAT 
218201 GAGATTCTGG GTTGAAAATT CTTTTCTTTA AGAATGTTGA ATATCGGCCC 
218251 CCACTCTCTT CTGGCTTGTA GGGTTTCTGC TGAGAGATCC ACTGTTAGTC 
218301 TGATGGGCTT CCCTTTGTGG ATAACCTGAC CTTTCTCTCT GGCTGCCCTT 
218351 AACATTTTTT CCTTCATTTT AACCTTGGTG ' AATCTGACAA TTATGTGTCT 
218401 TGAGGTTGCT CTTCTCAAGG^ AGTGTCTTTG TGGCATTCTC TGTATTTCGT 
218451 GAATTTGAAT GTTGGCCTGCf CTTGCTAGGT TAGGGAAGTT CTCCTGGATA 
218501 ATATCC^TGAA GAGTGTTGTC TAACTTGGTT CCATTCTCCG TGTCGCTTTC 
218551 CAGTACACCA ATCAAACTTA TATTTGGTCT TTTCAAATAG TCCCATATTT 
218601 CTTGGAGGCT TTGTTCATTT CTTTTCACCC TTTTTTCTCT AATCTTGTCT 
218651 TCTCACTTTA TTTCATTAAT TTGGTCTTCA ATCACTGATA TCCTTTCTTC 
218701 CACTTGATTG AATCAGCTAT TGAAGGTTGT ACATTTGTCA TGAAGTTCTC 
218751 ATGCCTTGGT TTTCAGCTCC ATTAGGTCAT CTAAGGTCTT CTCTACACTA 
218801 TTTATTCTAG TTAGCCATTC ATCTAACAGT TTTTCAAGGT TTTTAGCTTC 
218851 CTTGTGATGG ATTAGAACAT GCTCCTTTAG CTTGGAGAAC TTTGTTATTA 
218901 CCGACCTTCT GAAGCCTACT TCTGTCAACT CGTGAAACTC ATTCTCCATC 
218951 TAGTCTTGTT CCCTTGTTGG CGAGGAGCTG TGATCCTTTA GAGGAGAAGA 
219001 GGTGCTCTGT TTTTGGGAAT TTTCAGCTTT TCTGCTCTGG TTTCTCCCCA 
219051 TCTTTGTGGT TTTATCTACC TTTGGTCTTC GATGTTGGTG ACCTACAGAT 
219101 GGGGTTTTGG TGTGGATGTC CTTCTTGTTG ATGTTGATGC TATTCCTTTC 
219151 TGTTTGTTAG TTTTCCTTCT AACAGTCAGA TCCCTTAGCT ACAGGTCTGT 
219201 TGGGGTTTGC TGGAGGTCCA CTCCTGACCC TGTTTGCCTG GGTATCACCA 
219251 GGGGAGGCTG CAGAACAGCA AATCAGGCTG CAGAACAGCC TGATCCTTCC 
219301 TCTGGAAACT TCGTACCAGA AGGGCATCTG CCTGTTTGAG TGTCTGTTGG 
219351 CACCTACTGG GAGGTGTTTC CCAGTGAAGC TACACGGGGG TCAGGGACCT 
219401 GCTTGAGGAG GCAGTCTGTC CTTTCTCGGA ACTCAAACGC CATGCTGAAA 
219451 GAACCATTGC TCTCTTCAGA GCTGTCAGAC AGGGACATTT AAGTCTGCAG 
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219501 AAGCTGTCTG CTGCCTTTTG TTTACTATGC CCTGCCCCCA GAGGTGGAAT . 
219551 CTATAGAGGC ATTAGGCTTT GCTGAGCTGC AGTGGGCTCT GCCCAGTTCC 
219601 TGCTTCCCAG CCTCTTTGTT TATACTGTTG AGCTACTCAA GCCTCAGCAA 
219651 TGGTGGACAA CCCCCGCCAT CCAGCTGCAG CATCGCAGGT CGATCTCAGA 
219701 CTGCTGCGCT AGCAGTGAGC AAGGCTCCAT GGTCGTGGGA CTGGCTGAGC 
219751 CAGGCACGGG AGGGTATCTC CTGGTCGGCC GGTTGCTAAT ACCGTGGGAA 
219801 TAGCGCAGTA TTTGGTCAGG AGTGAACTGT TTCTGCAGAT ACAGTCTGTC 
219851 ATGGCTTCCC TTGGCTAGGA AAGGAAAATC CCCCAACCCC TTGAGCTTCC 
219901 TGGGTGAGGC GACGCCCCTC CCTGCTTTGG CTTACCCTCC ATGGGCTGCA 
219951 TCCACTGTCC AACCAGTCCC AATGAGATGA ACCCAGTACC TCAGCTGGAA 
220001 ATGCAGAAAT CACCCACCTT CTGCATCAAT CTCACTGGGA GCTGCAGACC 
220051 GGAGCTGTTC CTATTCAGCC ATCTTGGAAG TGACTACCAT GGTCTTCTTT 
220101 ATTGACTGTG GAACTTACCC TTAGAACAGT TTCTCAAATC ATCTTTTGAT 
220151 TGATTGCATT TGTGCATCTG TTTGTAATTT TTATTTTTAT TTATTTTTTA 
220201 TTTTTTTGAG ATGGAGTCTC ACTCAGGCCC TGTTGCCCAG GCTGGACTGC 
220251 AGTGGCGCGA TCTTGGCTTA CTGCAAACTC TGCCTCCCAG ATTCAAGCAA 
220301 GTCTCCTACC TCAACCACCT AAGTAGCTGG GATTACAGGT GTGCACCACC 
220351 ATGCCCGGCT AACTTTTGTA TTTTTAGTAG AGACAAGGTT TCACCACGTT 
220401 GGCCAGGCTG GTCTCAAACT CCTGACCTCA AGTGATCCAC CCACCTCGAC 
220451 CTCCAAAAAT ACTGGGATTA CAGGCATGAG CCACTGCTCC CAGCTTGTTT 
220501 GTGAATTTTT TTTAAAAGAG GGACTCTCAA CCACTTTTAA CTGAACTATT 
220551 TTGTATTTGA GAATGCCAAA CTGTTGCCTT TAAATTTGAG AAACATTGGT 
220601 GACTATTTTC CCCCCAAAAC TCAGAGCCTT TGCCTTCAAG TAAAAGATGG 
220651 TTTCATTTTT CCTGTTACAG TGATTATATA TATATATATT TATTGATTTC 
220701 CGGATGTTTT ATGATTATTA TCTCCTTAAT TAATATTCTC ACAATAATTC 
220751 TGCATGGACT ATACTGTCTT TCCTATTTCA TTGGTCAAGA AGTTGAAGGT 
220801 AAGAGAGGTT AATAATTCAT AAGGTCATAC ACATATAAGT AGTGAAGTCT 
220851 TGATGTGTGC ATAGGACCCC TTGTTTCCAG TGTCTTCCCT CTCTCTTCTC 
220901 TGCCATATGT CTTCTCCTGG ATGTAGAGAA TCTGGGAATG AGGTAAAAAT • 
220951 AAGGCTATGT GAGTTGTGGT CAGTTTGGGG GATCAGAAAG AAAAAGCTGA 
221001 TTTTTGCCTT CTAAATTCTA TTACCAGACA TGTAAGAAAA TACTATTTCC 
221051 AAGAATCCTG AAATCAAAAG TAAATAATGC CATTGAAATT AGAGAATATT 
221101 TCTTAGCCCT ATGCTCTACG GGAAGATTTC TTTTGAGGTT GACAGTTTCT 
221151 TGGAATTCTG ATGGCTGGGC ATAATGCACT TTGCTCTGTC ATGTTACTGT 
221201 AAAAAATGTG TTCTGTTTAG GAAAATTAAA GGCATTACAT CAAGATCCAT 
221251 AGTCTTATGT CAGCTCCATT ATCTTGAATC ATATTTAATA GCAACATGAT 
221301 ATTTCTGGTT ATGAACTCTC CTTTGGTTTG GTGTTTAAAG TGTGTCTTAA 
221351 GCCTAATCAA TTCTTGTGAC TGTATGTTCT ACAAAGTCAG GTTTCTTGAG 
221401 TCAAATGGTG AAGCATTGAG CATCTAAGAG CAGAGTCTAG AATAGCTAGA 
221451 ATAATGATCA CACACACACA TGTGTGTGTG TATATATATA TATATATATA 
221501 ATCTGATAAG AATTAATGGT TCATAGATCA CAAATGGAAT ACATAACCCT 
221551 GGTGATCATT TCATTACTTG TGGTCTCCAG AAAATGCAGA GAAGGGATGT 
221601 TAAATTATAT GATGTATGAA AACCAAAAGG ATCTCTTTTG CACGGAAAAC 
221651 TCAAGTTGCA GGTTCTGCCT CTGAACTCTA CACGTTCAGA AAGGGTGGCC 
221701 TGGGTCTTCA GAGTGAGTTT GAAGGCAGCC TGACATGCCA TGTCCCTACG 
221751 GGACCTTCCA CAGCACGCAT CCTCTCTATT ACTTCTTGGC ACCTCATAAA 
221801 TTGGTGACAT TAATTATTTA ATTGATTAAC TTCTTACTTA TTAATCTCTA 
221851 CGAATAGAAT TTTGCTTTAA AAGGGACCGG TATACCCTAT ATATTGTATA 
221901 GTTCTTGGCA CATATAGGCA TTCAAAAATA TTTGTAAAAC ATTATGTTTT 
221951 CTGTAAGTCT TGTCTTCTGA ACTAAGTTAT AAACTACCAA AGAACAGAGT 
222001 TTATAGTTGA TACTCTGAAT GGCCCATTGG ACACAATAGT GCTGAGCTGA 
222051 GAGCAGGATC TCAAGGGCAA TTTGTTATAC TGAGTTGAAT TATACCATAT 
222101 GGAACTACCC ACATGAAGGT CTGCCTTCCT GGCTTCTGGC TCTGTGCTTC 
222151 CATAGTTCTT TGTACATGCT TCTTTTATAG CCTCTAATCit ACCCTTTTTT 
222201 AATTTTTTGT TCCTACATCT GACTGCCTCT CAATATTTTG TGCTAAGAGC 
222251 ATTTATATCT CCTTTCTTCG TTGATAATTT TGCACATTGC AAATGATGAA 
222301 TAAGAAAATG GAACTAACAT TATATTAATT GGCTGCTATG TCAGGCATGC 
222351 GGCTATACTA TGGACTTTAC AGATGTCAAT TCAATTAATC CTCAAAATCC 
222401 TATGAAGTTG GTAGTTTAAT CTTCATTTAA TAGTTAAAAA AACTGAAACT 
222451 CAGGAGTTGC TTGCCCAACA TCACGTATCT GGCAAGGATC AAAGCAGAGA 
222501 TGGGAGCCCA GTTTTTTTCA ATTCTGAAGA CTTTACTGAC TCTTCTACTG 
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222551 CGCTAATCCG TTTTTGCATT AATTAATGAG TTAACAAATA GCAACAGCAG 
222601 TGGTAGGTAG CAACTAAAGC TCTTGTGTTC ATTAACAATG GAGCTGGAAT 
222651 ACCCAATATG TTCTTACAAG GGAATCGTTT TCATTTCAAA CTGATATCTC 
222701 TAAGCAATAC CCTTGGAATC ATTTGTGTTG CCTCCCTTAG AATAGTCATG 
222751 TTGCCTTTGA ATAACAGGAG GTCCTTCTCC CATCTTGTTG AGGGAAGCTA 
222801 CATCTCAACA AGGTGGGAGA AGGACCTCCT GGAGTCACTG ATGATAACGT 
222851 TCTTTGGGAG GTTTAGTGAG AAGCTTCATT GCTGAGGGAG AGGGGGTGTC 
222901 ATTAATTCTT GGGTTTTAGG AGTTTAAAGG AGGTATTATT CCAGAGAGGG 
222951 ATGGGGAAGA GTCAGACGTG AGGCTGTCCA GTGATGGGTG CTTTACAACA 
223001 GCAGTCTTTA GACTTGGTCA GAGTTACTTG TGGACATACT TTTTAAAAAA 
223051 TCAGGTCCAG . AACCTTAATT TTCATATTTC GTATTTACCT GAAATTTATC 
223101 TCCCTGATAA CTTGCTTGCA TTTAAAACAG AGCTTTTTCT ACTTTAAAAA 
223151 AGAAAGATGT TTCTTCATCC ATCTCAAATC TTACAAAAGT GCACATCCTC 
223201 CCAGACATCT AAGGGGAAGG TTTCTTTCAT AAGTAAGTTG CCATATATCC 
223251 TCAGGGCTTT TAGTCTTTCA TGCATTAAAA ATGTGTTAAG GTTGATGTAC 
2233 01 TTGTCAGAAA GAGAGTATTT GGGTGCTGTA TTGGGATTTC TGATTTCATA 
223351 TCTATGGTAG AGTAAGAGAG TGGACTAATG ACAATATTCA CTTAGTCATG 
223401 TATCCTCTTT CACCCATCAC CTATTGTCAA GTAATGCATT TCTGTTAAGG 
223451 TAGTGTATTA GGCCTTTCTT GTATTCCCAT AAAGAAATTC TTGAGAGTGG 
223501 GTAATTTATA AAGAAAAGAC ATTTAATTGG CTCATGGTTC TGCTGGTTTT 
223551 ACAGGAAGCA TGATGCTGGT ATCTGCTTGG CTTCCAGTGA AGCCTCAGAG 
223601 AGCTTTCAGT CATGCAGAGC TGAAGCAGGA GTAGGCACTT CACATGGCAA 
223651 AAGCAGGAGC AAGCAAGAGA GAGTGGGAGA GATGTGCCCC CCCCGCCACC 
223701 CAACTTTTAA ATGACCAGAT CTCATGAGAA TTCACTATCA CAAGGACGGA 
223751 ACCAAGCCGT GAAGTATTCA CTCCCATGAT CTAAATACCT CCCACCAGGC 
223801 CCCACCCCCA GCATTGAAGA TACAATTCAA CATGAGATTT GGACAAATAT 
223851 CTAAACTACA TCAGGCAGAG TCCTGTAAGA GTGAAGAAAA AGACTCAGTA 
223901 AGAAACACTG AATATGAAAA CTGGCTTTTT CCCCATTTTT TTCCCAGCTC 
223951 TGGTTAATAC TTATTTTTAT GACCTCCATT GTGCTAAAAA GAAAAAGATT 
224001 TTCAAACAGC CAAAGAATTA GAGGTTCTTT TAAGGAATTT GGAAAGTGTT 
224051 CTAGAATTAT CAAAAAAAAA AGGTAGGTGA TATTCAATTA AATCAACCAA 
224101 TTTAAAAAAT TACATATGTT TTCATGTTGG TGTAATGTAA AAGCTCTGAA 
224151 GCTTATTTTT GAAATCAAAT TACAACATTT TCTTCCAATT GCAGGTAATC 
224201 CTAATTTGGT TTAAATTTTC TACCTGCTAT TGACAAGAGG TAAATTTATG 
224251 TGCCAGGTGC CAGACCTATC AGTTTTTAGT AGTATATATC TTTCTTTATC 
224301 ATAAAGTATG CCTCAGACTA ATCTATCTTT CATAATGTGA TCTGCAATTT 
224351 ACTCAGTTTT TACAATATTC CTCTGAATAC ATATTAACAT ATTTTTGAGT 
224401 TAAAAATTAT AGTTGGTTTG TGGATTGGTT TGAAAATGAG GGCTGGCTTA 
224451 GCCAACAAGG TTATATGTTA GTTTATCCAA TATAGTTAGG ATATATTAAT 
224501 TTATGCCCAT GAAGTGACTG AGCCAAATCA TCAGTGCCAA GGTTTATACA 
224551 AAATATACTT AAAGCATTTG AAAAGATAAA ATATTGCACC GAAGATATTG 
224601 TTTGCAAGAT ATGTTGCAAA TGGAATTTGC AACGTTTTAA TATTCTCAAA 
224651 ACTTTCTAAG ATTAGAAAGT TATTCAGGCA CCTCTAAGTG AAAGAATAAA 
224701 AGGCAAAAAC AGCAAGTCTT GCTAAAATGA GGTAATCAAG AAAATGGCAG 
224751 AGAAGCAAAT CCTTTTGGTT CATCAGGTCA TTTTTCATTT ATTTCAGCAA 
224801 CAAGCAAGTA GGATACTGAC ATGTCCTCTT ATAAGACAGT TAGATTAATT 
224851 AGGAAAAGAA AAGAGAATTG GCTGTGTTTG CACTCTGAGT TAGCAGAGTA 
224901 CCAGGTATTC AGTTACAGAA ACAGAGAGTT GAATTATCAG TGAATCCACT 
224951 GGCTTCATGG AACAAATGGT TTCTAAATAT TCAAATGTTT TGGTACCTGA 
225001 CAAATGCTTA AGCTAGT^TT TAAAAATCAA CTTCAATCAT TAATTTAAAA 
225051 AATACCTTAA ATGCTGCCTA TACAGAGGTG TACCCGGCTC TGGCGAGGGG 
225101 CATTGCACAA GTCAGACACA ACCTCAGTCC TCAGCCACTT ATCATTCAGT 
225151 GGACAGTGTT CAACAATACA CAGAAGAGTA AATAAATAAG CTGATATAAG 
225201 ACTGTGATAT TCATCGTGAA CACAATAATC CAAAGGAATG TGGTCAGCAT 
225251 TATCTGGTAA GGACAGCAGG ATTGCCTTAG CATAGGCAGC CAGGGAAGTT 
225301 AAAAACTGGG CTGAAGTGAC CCAGTTTTTA CTTCTGTTTC GTCTCTGTGA 
225351 CCTGAAGTGG CAGCCCCGTG AGAATCCCCA GGTGAAGGTA AGATGCAGAA 
225401 CGCCATGAGG TGGGGATGAA CCTGGACTCT TAATTAGCCC TGCATCTCAG 
225451 ATTATAAGAA TAATCTTCCA TTTTAATGGG TCTTTACAAC TGACATGGGC 
225501 TTTATAGATG CTGAGATTTC TGAGTTCCAT AGTTTGTCTT ACGTGTTCAT 
225551 TCTGCAAGAA GTAGTATTTC TAGAATCTCT ACTGTGCATA GACTGTCCAA 
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225601 CTACAAGTTT TCCTTCCCTG TGGTCTGCTC TCAAAATTAG AGATAAAAAA 
22 5651 TACAACTTTT ATTTTTGTGT AATCATCAGG TAATAGATGT CTATTGTGAA 
225701 CATGCTGGTG GAAAAATAAA TACCAAAAAG GAAATAATTA AAGCTTATAA 
225751 GAAAATAATA CTGAAATATG CCAAATAATA ACAAAATATA ATAAAAAATA 
225801 TAAAAAGAAA AAGAAACACC AGATGGATTT CCCATTCTTG AAATGAACCA 
225851 CTATTAACAG TTGTGTGATT TTCTTTGTTT CTAATCTTCA AAACAAACAT 
225901 ATACACCCAT ACAATTACAC ATATACATAT AATGAACAGT GCATTTTTTC 
225951 CCTTCAATTT CAGTTTAACA TATAAATCTA CTATCTCTCT ATCTATCTAT 
226001 CTATCTATCT ATCTATCTAT CTATCTATCT GTGTATCAAT CATCATTTAT 
226051 CTATTTATCT AACATCACAT CACTTCTCAA CCTTTTGGCT AAGAACAAGT 
22 6101 GTAGTATCTG TTCTTATCAG TTTAATATCT ATCTACCTAC CTACCTACCT 
226151 ACCTATCATC AGATATAAAA GCGGAGAAGG TAATGGCTTA CTTGGTGCCA 
226201 TTATAGGAAC AACAGATGTC TTGGACTGCT TCTTTGATTA GGCCTAATAT 
226251 ATTGACAATG TTACTTCTTT GTTTAGGTTA AGGCATTAAA AATTTATTCA 
226301 TAAAATAAAT CAATCTATTC ATAAATAGAA ATAGTCTTTT TAGAAGGCAT 
226351 CACTTTAATG CATTATAAGT CACCCTGTCC TCACCTAGAA TGATCTTTTT 
226401 ATATCAGTCA GCCGGCACAA CCCTGTCAAC ATACTTCTGA GCATGTGACA 
226451 CAGACATTGG ACACGAATAT TATGGGACTA GAACAACTTA GGAAGTGTGT 
226501 GAGCTATAAA GCCTTTATTT TTTTTCTTTC ACCTCCCCTT CCTATTTCCC 
226551 TAGAAAGATC TACCATTTCT TCCTTTTTGT CTCCACATAT GTTTCTTTGA 
226601 TGGCACATAT GTCTTATTAT ATTTATTTGA GTGGCAGAAG TAAGTTTCCT 
226651 ACGTAAGAGG CGGTATAGAA AACACCGAAT TTCTTAATTT TTAAAATCAA 
226701 TCTAGATCAA CCCTTGCTTG AGAGAGTATA GAACTAAATC AGTCTCCTTT 
226751 CTCCTTTCTG CTTCTCCCTG TCCCTGTGAT AACAGAAGCA CCTACTGCTT 
226801 ACCTGAGAAA GGTGGAAGGA CGTCTATCAT GCCAGCCACT CATGGACGTG 
226851 CCCGTTTCCT GAACCCTTTT GGGCTTCTCC AGGGCATATG GCGATTTCCA 
226901 AGCAGGCGAC TCCATTTTCT TACTGTCTAC TATGAACCTA GAACTCTTTC 
226951 ACCAGTGGGT TGTGGTCTCA TCATTTCCCA CCAAAGCTTT TCCTCCTCTG 
227001 GTGAGAAAGA CATTGGGTAT TATTCACAGA AACCCCCAAA CCTAAAACCA 
227051 TTGTTCTTTC CCATGGTCTT CAGGTATCTT TCCTCTTTTA AAACTCCAGC 
227101 TTCGGTTTTT CTCAATGGTT CTAGACTCCT AAAGAAAACT GTGGGTTAAA 
227151 AAAGATGAAC AATTGCCCTG CAACTTCTGC TTTTGTCCCA GATACCAAGT 
227201 TTTTGACAAT TTTCCAAGAC TATGAAAGTA GCAATATGAA GAAAATTATT 
227251 TCAACTTAAT TTCCAGTTGA GATGTGAGAA AACAGCAAGA AGTTAGCATC 
227.301 ACAATAAATA ATCTAGCACA TTTGTTTACA TGTCTATGCC ACTCTACAGC 
227351 ACTGCAACCC CACTAGCTCT GGGTCCCTCA AAAGAACATA GACCTTAGCA 
227401 AATAGCAATG TGTCCCTACA AATGTTCATT GAATAAATAA ACACATTCAT 
227451 TATTACTTAC AGCTTAAAAC CACAATAAAA CACATTTTGA AATTTCCTTG 
227501 TCTCTCAGTG ACCTACAATT TGAGGATGAA AGCCCTATAT CTGGCCAAGT 
227551 CAGTTTTTCC CACTTTCTAG TGAAAGTGGA GAGTAAACCA AACCTTTGTA 
227601 CATCAATATT TTGACTGTGT AGGTTCTGGG AAGAATAATA ATTTGAGAGC 
227651 CTGAAACAGT ACGGAATATC TGGGTAGCTG ATTATTTTCC TTGAGAAAGA 
227701 CCTTATAAGC AGGCATAATT TATATCAACT TTATCAAAGC TTCTCACATT 
227751 GAAGAATTAG TTGATGCGAG CTATGGCTGA GACCTCAAGG AGAGCTGAGA 
227801 CATCGTGGTG GGCTGGTGAG GAGGAAGCAG AGGCATCTGG AGTGGGAGGA' 
227851 ATTTTCAGTG CTTTGTGCTG * ACCCCTTGCA TAGGGAAGGC AATGGGTAGT 
227901 TAGCAATCAA ATAGGTGTTG AAATGTCTGA GAAAACTCAG GGTGGCTCTG 
227951 GGCTTACCTC AGTCGGTTTC CTAGGTAAGC TGATCCTGGG GGTGGGGACT 
228001 GGGGCAGG AA TTTCTAGTGG TGACAAGGAT CTTCTGAGTT ATTTTGAAtG 
228051 TTGCAGGAAG GTGATTTTAC AGTTAGGGCA TTTATCAAGT AAATTTTAAT 
228101 TCAGTATTCA AACTACTAAT GTTATAAAGT TCAAAACCTA GATTTAAAAA 
228151 GCCTTTAAGT CTGCATGTAA TTTCTTAACT TCTTTAAATA GATAATTAAT 
228201 TTGAGTGGTA TGTTTCCTTT TTGTTTTAGA ATTTCAGTGG GAGCCTTAAA 
228251 ATGGGGGAAT GATAGAAAAG TGCTGGAGTT GTCTATATGG GAGTTAATAG 
228301 AAAATATCAA AACTGGCCCT GGGACCCTCA TCCCCACTTC TTCATCAGTC 
228351 CATGTAAAAG AGGAAAATAA AGGCCGGGCG CGGTGGCTCA CGCCTGTCAT. 
228401 CCCAGCACTT TGGGAGGCCG AGGCAGGTGG ATCACGAGGT CAGGAGATGG 
228451 AGACCATCCT GGCTAACACG GTGAAACTCC GTCTCTACTA AAAATACAAA 
228501 AAAAAAAAAA AATTAGCCAG GCGTGGTGGC AGGTGCCTGT GGTCCCAGCT 
. 228551 GCTCGGGAGG CTGAGGCAGG AGAATGGAGT GAACCCGGGA GGCGGAGTTT 
228601 GCAGTGAGCA GAGATTGTGC CAGTGCACTC CAGCCTGGGC ATCACAGCGA 
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228651 GACTCTGAAA AAAAAAAAAA AGAAAAAGAA AAAGAGGAAA ATAAAGTATG 
228701 GGAAGACAGA ATGAAGGGAA TTAATAGTGG TCCAGAGCCC TTAGAATTTT 
228751 TTCCTTCCTC TACTTGCCCT ATCTAACTGA AGAGGGCTCA AAGAATTCAG 
228801 TCGTAGAGAC TGAAGATGCC TGTAAGGGGG TGGGGCATTG ATATCTTACC 
228851 CTCTGGCTGA AGGTTTGCTG AGATGCAGAT AGAGCCCACT GCATGGTGAG 
228901 AGAGCGCTCT CGGGAGAACC AGACATATGT ACCTAGTGGT GAAGAATTCT 
228951 GAAGCATGGA GGTGAGTTGT AGGGACCCAT GATGCCGGAG CAAGGAGAGA 
229001 TGACAGAAGT. CGGAAAAAGC CTATACTTCC ACAGTTTGAC AAGGCAGGAA 
229051 TGCATGCAAT TCCCTATGCA AAAGGAATGT TTAAGGCACA CACTTGGCTG 
229101 AAGCTAAGTT GAAAGACTTT CTGCACAAGA AGTCGGCCTG CACAGCAGTG 
229151 CCAGAATCCT GTGCTTAAGC TGAGGTTGAA ACTAATTTAC TTCACCACTT 
229201 CATTTTATTT AGGTACTCTT TATTGACTGT GTTCTACAAT TCCCCCTAAA 
229251 ATAACTATAC AATTCTCTAG TCATCTTCCT AACAAGAATC ACAAAGGAGA 
229301 CTGTAGTATA GTTTCAAATA CCCTTCAGTT TGAAGCAGAA TCTTAATCCT 
229351 GCAGCAATTA GAGAGGGATT AACCCACACA GGGAGGAACT GAGAAGAACT 
229401 AGGTCATGTA TCCTGGTGAA CCTCAGGGAA GAAGGCTGGG ATTCAGGGTA 
229451 GTAGAAGAGT ATATCTGAGG CTCTGGGATA AAGTGATACA AGACATTTCT 
229501 GGTTGGGCGT GGTGGCTGAC ACCTGTAATC CCAGCACTTT GGGAGGCCAA 
229551 GGCGGGTCGA TCACGAGGTC AAGAGATTGA GACCATCCTC GCCAACATGG 
229601 TGAAACCCTG TCTGTACTGA AAATACAAAA ATTAGCTGAG CATGGTGGTG 
229651 CGTGCCTGTA GTCCCAGCTA CTCGTGAGGC TGAGGCAGGA GAATCGCTTG 
229701 AACCCAGGAG GTGGAGGTTG CGGTAAGCCG AGATCATGCC ACTGCACTCC 
229751 AGCCTGGTGA CAGAGCAAGA CTCCATCTCA AAAAAATAAA AAATAATAAA 
229801 GAAATTTCTT ACATAGGAAG GTTGTGGTCT TCAAGCCTTA TCTTTCTCCT 
229851 CTTCCTCCTC CTCCTCCTCT TTCTCCTCCT CCTCCTCTTC CTCCTCCTCC 
229901 TTCTTCTTCT TCTCCTCCTC TTCCTCTTCC TCCTTCTCCT CCTCCTCCTC 
229951 TTCCTCCCCC TTTTTTTGTT GCCCAGCCTG ATTCTGATTC CAACATTTTA 
230001 CTCTTAAAAT AATCTTGTCT AATTGAAGAT TTTTTTTGGA GAGATTGGTA 
230051 TAACAATGAT TTTATTAGCT TTTCTTTGTC AGCAGATTTA CATTGCTSCT 
230101 AAAGAGCTGT ATCAACTATT CCTTAGTTTA ATGATGAAAA TATGATTGCT 
23 0151 GGAATCAGTC AACACCTCCG TGGGTGTAAC CTGAACATAA TCAAAACATT 
230201 GCTGGGTTCT GACTGAATGA AAGCAAGGTG TGGTCCAATT GCTCTGAATA 
230251 TCATATCAGT GTTGAAAATC ACAGGTGTGG CCTGATATGA CAACTCGTGA 
230301 CATAAAATGA TTTTACCTAA TAGCACATAA ATTTTAAAGA TCTGATAATA 
230351 GTCACCTCAA CAGGAAAAAG CTAAAAATAT GTGTGTGTAT GTATGTGTAT 
230401 TAGAAATATA TTAGATAAAA ATATGTGTGT GTATGTATGT GTATTAGATA 
230451 ACTTATGCAA ACCACAATTA TATGAATAAA ATATGGATAG TTTCAAATCA 
230501 ATAGAGTGTT CTGCGGTCTT ATGTAGAATT TGGGAGTAGG GAAAGGGAAT 
230551 GTTGTTCAAA GTCTATCATG ACCCTATGCA ATGGTCCTTT AAAGTAGACC 
230601 TGTTTGTAAG ACACTAAGCA TGTTTACACT CAGGGGCTCT GGGTTGTCAA 
230651 AAAGTTGGTA GAAATAAGGT TGAGGAATAA GGTTACTTAC TGTTGATAGG 
230701 TACATAGGAA CACTTTATAG ACTTAAGGAA ACAAGACCAG TTCCACACTC 
230751 TATAAAATGG CTAGGCTTGG TCTTTTTACA TATTGCCAAG CAAAGCTTGG 
230801 ATTGCGTTAG TGTCTGGCTA CAGCTGGACC ATGCCACGGA ATTCCTCTGT 
230851 GATGGCAGCC AACCACAACT GGGAAAAGAG AGGAACTATT AGACATAAAA 
230901 AAAGAACTGG AACTAGCACA TAGAATTAGT GAGAGAGCAA GTGACATAGC 
230951 AGATAGAGTT ATAAAAGCAA TAGAATTATG TGACTCTAGT TTGAATTCAC 
231001 TGTTTCCTTT ATCACTGTAA GAGGATTGGT GCCCTTTCCC AAGATAAAAG 
231051 AGGCAAGTGA AAAGTTACTC AACTGGAGAC AATAACTAGG ATTTTGTACA 
231101 CTTGAAAGGC TTTTAAGATT TTCTTGGATA CATATTTATG GCAGAAAATA 
231151 TTTGTTTTGC TCAAAGTTTT CTAAGCTTTT GCAAACCTGA AGTCATTGTA 
231201 TTAATCAAGA ATGTTCAACT GCAAAATATG GAGCACATGG TTTGTCCTAG 
231251 ATTAAAAACC TAAAGCTGCA TATGCCACCC ACACATAAGA AGAAGGCTGG 
231301 CAGTAACAGT CCAGGGCAGT GGGTTTAGCA ATGACTCCAG GGACCCACGT 
231351 TCCTCCCATG TTTCTATGCC TCATTCTTAG TGTCTTGGCA TTTATGCTTA 
231401 TGTATGCTGC TCTATGGGCA CTGGAAGTTG CCACATCTCT AGGCTTTGTG 
231451 ACTGCATTCC AGACAGAACA GGTTGCTTCA GCAGATTTCT GCTTATATCC 
231501 CAGACCTATG TTATATGCCC ACACCTAGCT GAAAAGGGAA CTAAATAAAC 
231551 CTGTTTCCAG CTTTTAAGTC CTTGAGAACA GTCAGGAAAG GAGAAGCGTA 
231601 TTGAAAGTGC AAATTGAGGC TGGGCGCGGT GGCTCAGCCT GTAATCCCAG 
231651 CACTTTGGGA GGCCGAAGCA GGCGGATCAC GAGGTCAGGA GATCGAGACC 



WO 02/086113 



77/219 




PCT/GBO2/01887 



231701 ATCCTGGCTA ACACGATGAA ACCCCGTCTC TACTAAAAAC AGATACAAAA 
231751 ATTAGCTGGG CGTGGTGGCA GGCATCTGTA GTCCCAGCTA CTCCAGAGGC 
231801 TGAAGCAGGA GAATGGCGTG AACCTGGGAG GCGGAGCTTG CAGTGAGCCG 
231851 AGATAGTGCC ATTGCACGCC AGCCTGGGAG ACAGAGCGAG ACTCCATCTC 
231901 AAAAAAAAAA AAAAAAAGAA AAAGAAAAAA AAAGAAAGTG CAAGTTACAT 
231951 CACCCAGCCT GTAGTGTTGG CCCACAGCCA TTCCAAGGAT GAATATTATA 
232001 TAAGCAATTA TTAGATCACA CAATATGTAC TGAATTATTA ACATTTCTCA 
232051 TCTCCTAAAT ATAGTAATAT GAGAGCTGGT GGTTTTAAAT CTTTCTTGCT 
232101 TTTAGCCTAA TTTATCTTAG CTTCAGCCCA GTTTCCCCTA GACCTGGGTC 
232151 TGTGGTTCTG TAAAGAAATG GCAAGCTGAG TTATACTCAG GCATAGCAAT 
232201 TAAATTCTAT TGTGCATGCC AGCTCCAGTC ACTGGAAAGG GCTGGCCAGG 
232251 CTGCCACGTA GAAATGAATT CTGAAGTTGC CTAGAGGCCT GGCAGGAAAG 
232301 TGGGGCCTTG ATGTTTTGCC ATGAGAGAAA ATGGAGACAA TGTTTTCCTG 
232351 TCGGCTGAGT GTGTGTTCTC CCTTTTCATA CAAAATTCAG TTATACTGTA 
232401 GACATTTCCC TGGAAAGTGT ACTAGTTTTC CTCTTTCCTC CTGTAATTCT 
232451 TTTAACACCA ATATTTTTGA AGAAAGGGAA ATGGATGCTT TTGCTTAGTA 
232501 AACTGAGTTC AACTGGGAAG CCCTTACGTT GTAGAAAGGA AGGCCAGGTT 
232551 GACAGTCAGC TACACTTGTT TCTTTGAGTT TAAGAAATTT ACCTGGCGGT 
232601 ATCACCATAA ATTCTATCTT CTAAGTGATG TTCAAAGAAG TGACTGTAGC 
232651 CACATTAATT TTGTTGCAGT TTGTGCTATG ATTCTTATCA CCAATTATCA 
232701 GTTGGAGCAA AGCTTTATGC CCAAATTAGC AACCTTAAAT TTATCTACCC 
232751 AAAGCTATTT TTGTTCAAAA TTACTTACTC ATATAAATGA AGTCATATTC 
232801 CTTGTATTAG TCTGCTCGGC TGTCGTAACA AGATACCACA GACTGTGTGG 
232851 CTAAAACAAC AGAGATTTGT TTTCTCACAG TTCTGAAAGC TGGAAAGTCC 
232901 AAGGTCAAGG TGCCAGCAAG ATAGTTTTCA TTCTGAGGCT GCTTCTCCTG 
232951 GCTTGTAGGT AGCCGCCATC TTGCTGGGTG CTCACATGAC CTCTTCTTTG 
233001 TGTGTGGTGA AAGAGAGAAA AAGCTCTCCA TTGTCTCTTG TTAGAAGTCT 
233051 CTTCTTCCTA TTGTATCAAG ACCCCACTTT TATGACCTCA TCTAACCTTG 
233101 ATTACCTCGT TATAGGACCT ATGTCCAAAT ACAGATTAGG ACTTCAATAT 
233151 ATAAATTTGG GGAAACATAC ATCAGTTCAT AGCATCTCTA TAAAATAAAA 
233201 TTATAGGGAA GTTATAATTC TAAAACAGAA GATATATTTT TTCATTTTAG 
233251 ATTCAATAAA ATATCTTTTA TTCTAACCTG TATCGTAAGT CATGATATCC 
233301 TCTGTCCACA GAAAGTTCAA GGTCTTTAGA AAGATTTGTG AGGAAGCACG 
233351 AAAAGCAATC CAAATCCGTT TATTTTATCA GAGTCTTATT TTCAGCCAGA 
233401 ATTATTTAGC TTGAGAGGTC ACCAAGAATA ATAAATACAA GTAGTCCAAA 
233451 GGTTAAGTTA TTTTCTTCCA CACAAACTTT AATCAGTTTT ATGATTTTTA 
233501 ATTCTTGTTT TAATTAGAAA TCAGAGGATA TTGAAAATCC CCTCTAATGA 
23.3551 GCTTTTATTT TATTTTTTTA TTTTATGGGA GATGATTTAA GTAATAAGGC 
233601 TTGCCAAGTG AGTATCAGTT TTTGATCAAG GAAATAGTTC TGAGCTATTC 
233651 AAAAGAGTAT TTAAAACTGA GGACGGAAAC AAGCCTCAAA TTGGTATATT 
233701 TATTGTTGTG ATTTAGATAA AATAATTGGA CTAGAAGTTT AGAAAAAAAC 
233751 AACAAAAAAA CAGTTTCACT TCTGATTGTC TCTCTTCTCC TCGAATAAGT 
233801 CCCAAGAGAA GAATGCCTCA TACAAAACTG ACCAGTACCT AGGACTGGAG 
233851 GGAAGAATAT CTGGGTTTGA CTATACCACC AATACTCGCC AGGTGTGTGA 
233901 CTTCGGACAG AAGATTTAAC TCTTTTGAGC CTCAACTTTC TTATTCATAA 
233951 AGTGGGACTA TGAATTTCTG TCTAAAACAT ACGAGTATAT ATATATTTAG 
234001 AATTGTGCCA GTCATATTAC TGGTATGCAG TAAATTTGAT AAGAAGTCCT 
234051 TTGTGGATAG ATATGTATAA AACTCATAGT TGTTTAACTC CAACACACTT 
234101 TTAATTTTTG GTTAAATATA CAGGCACTCA AAGAATATTT GCTGAGTCAA 
234151 TGAGACAATC TATTATAAAC AAGTGACTTA TCATGTCATT TCCTTTACTC 
234201 CAATATTCAG AGTTTGCCCC AATGGATCTT CCAGCCTTGG TAGTTTATGA 
234251 CATTTTGATA CAGAATAGAA TTGAAATCTA CAATTTTTAG GACTGGCATC 
234301 AAGGAGCAGC AAATAAAAAT CTCATGTTAG TCAAGGCAAA GTTTGTGTGA 
234351 ATGAATAAGC CATTAACACT CCAGAAGTGT GAGCAACAGT TTTCTCTTTT 
234401 TGGATTTCTT TTGTCCA7iAT TACAAAAGTA GTGCATATTT GTTGGAGTAA 
234451 AATTTAAACA AATTAAATCC TCTATAGAAT TCTAAAAAAT AGAGGAAAGC 
234501 CTGGCTTTCT TTCATTCTTA TACTCCTTCT TATCCTACTT CCTAGGATTA 
234551 AATGATGATA AAAGTTTGAT ACATATTTTT ATCACAGCTC TTTCTCTCCG 
234601 TGTCTCTCTC TCATATATTC ATGCAAAGAA ATTGACAGGT AGGGTlTTTC 
234651 TTTTAATGAA TTATATTGGA CAGTAAGTAG ATTTATAATT TATAAAGAAG 
234701 GGAACACAGA GAGGCCTCCC AAAGAACGAC TCCACATTGT TTAATTTGCT 
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234751 GTAGTGTGAA ACCTGAAGGG TGTCTGGGAC TGGGCCAGAT GAATCAGTTT 
234801 TTGCCCAGAA TGGCTCTTCC TGGTTCACCG CAATTATATC ACACAAATGA 
234851 CCACCTTCTC CACATCTATC CTGGCTGGCT TGATATTAAT CAGTCTAGTT 
234901 GAGGTCTGTT CCTTTTTACT AAACCCAAAT CTGACATTAA ATATTGAAAC 
234951 TGTTCAATAC ATCAATAGGA CAATCGGAGA GGGCTGTTCT TTTTACTTTG 
235001 CAGTTCCCAG TGACACAGAA CAGCTGAAAA TCTGATCACA ACATGCCGTT 
235051 TTGAGTTCAC TTAATGTTCC CTATCTCCTA GGCTCCATAA ATAAAGAATC 
235101 AGTGTTTGGA GAGTGATTTC AGGGTCTCGT CCATCTTCAG TGTCTTTAAG 
235151 GATGGAAAAC CTGAATTGCA GGGTCGGCAA CATTCCGCAG TCTCCGGCCT 
235201 GAGAGCCAAA AAGAGTGAAG AGATTCTCCC TAAATCTCAG GCCTCTCTCT 
235251 GGCTGGGTAT TGGGCACAGA TTTTAATCAC TGGGTGTCGT CGACCCTTGA 
235301 GTGCCATAAC CTCTGTGCGG ATGAAGACAT TACTTTTGGA GGGGTAACAG 
235351 CATATATCTG CCCACAGTCT TAGGTCTATT TTTTGGTGAC TCTTGTATTT 
235401 GAGAAGCTCC TGCTATGTCA GAATCATAAT AATTTTTCAA GGACATTTTA 
235451 TGTTTTCTAA GGTTTGAGTC CTCATCAACC AGGATTAGAA GAGACACGTC 
235501 AACTTTCACT CCCTTGCTGT TTTCCAGCAC ATTTAACATA ATGCTTTAGA 
235551 AAGAGTTTTC ACAAACAGTA AACATCTGTG TATAATGGGG TCGACATTAT 
235601 TCCTGACTTT TCAACAATAG CCTAGTCAAA ATCCTTTATC ATCCTATGTT 
235651 GTTCAAATTA CTAGCACQAA TTTCTTCTTT CACTCCTCCA TGTTTTTCCT 
235701 TTTTGGGTTT CCATTGATTT GATGACTTTT CTTAGCTTTT TGTTAGTTGC 
235751 TTGCTCCAGT CTTTTTAACC TCTTCTAGAG ACTGCTCTAG AATTTATCCA 
235801 GAAAACCCTC TTTGCTCTTC TTTAGTATAA ACACCACTCT TACCTTCAGC 
235851 TTTTACAATT ACAACAGAAT AGCTGGACAT CACTACAGAA TTTTTTATTT 
235901 TTTTTTGGTA GATTTTCTCA TCTGGAGTCC TATTTGAAAA CGTACTTTTT 
235951 GTATGGTCCT TACTTTAGAT CCATTGCTTA AGAAGTTTTG GATAAGTGAT 
236001 TTCCAAGATG ATTGTAACAT TTAGAGAGAG TTGGCCACAA GTGTGCCGTG 
236051 CTTTTATTTC TTAATTTAAC AATTATTTAT GAAGACTTTA CAATGTGTTA 
23 6101 TCAAGCCCTG TCTGAATCCA GTTTTTTTCA GTGAAATTGC CAACTACATG 
236151 TTTGGTCCAA ATTAGAGACC TCTGATGTGA TCTCTCAGGT TGCCAGCTGA 
23 6201 AGGCATGTGT ACAGGCTTTC TCTGCCGCCA TGGTCTGTGC CTTGGCAACT 
23 6251 GCTCTCATCG GGCAAATGCC AGGGCTTTAT GTTCTGCCTT TTATACTTTC 
23 6301 ACCATCCACT CTGTAGCTGT ACTTCATCAT TATTGGAGTA GGAATTATAT 
236351 ACATTTGCAC ATTTTTCCTG AAACATGGTC ACTAGGTTGC AAAGAATAAA 
23 6401 AACCCATTCA AATAAATAAA GCAAAAATTG ACGAATCAAT GGGAATCTTC 
23 6451 TGAAGCTAGG CCTCACAGGG .CCTGTAACCA GGAACTGGAG GACTGGGGCC 
236501 AGTCCCACCC CTCAGTGTTT CTCTGCTGAT CTTCTTGTCT CTCCTTCAGC 
23 6551 TGCAATTCAC ATTGTCTTGA CTTTGGCAAA TGTCGGGCCC AGTTGTGATT 
23 6601 GTTTTAAACC CTACCCAGCT ACTCAGTTTT ACCAATTTTA AGTTTTCAGT 
236651 AGAGACTGAC TAGTTTTCCT TTTTTCCAAA TATAACTTTC AGAAAGAGAA 
23 6701 GTTTGTTTTG TTTGTATCTC TATAACTAAT TCCATTAGCG GAAACTGGGG 
236751 GTACTGGGCT CATGAGGTCT GTTGATTTCT TAATAGAGGC TGTTGGAACA 
236801 AGTTTTCTGA GAAGGAGTTA AGAAAGTTGG AAGGACAAGA TTGCTGCCTC 
236851 TAGTACAATT CTATACACAC GCACACACCC CACATATGTA CATATATATT 
23 6901 ATCTGTTCAT AAAAAAGATA TGCAGATTAC AAAATGGAGG TTGAGAAGCC 
236951 AGCATAATAC AGAAGAAAAA GCCTTGGAGT GGGAGGAAAG AGTCATGAAC 
237001 GCTTGAACTA GCTTTGTGAG TAATTGCTGG TGTCACTTTC TCATTTCTGA 
237051 ATTTCATTGT CCTAATTAAT AAGTGAGGAA TGTACTAGTA GTTGGTTATG 
237101 CACCTTAACA CCCTTTTAAA TAAACGATTT TTAAAAAAAT GGCTCTTATA 
237151 CTACATTTCT TCTTTTTTCC TATGTTAGTG GTTCTCAGAC TTTTTGGTCT 
237201 CAGGGACTCA TATCCTTAAA AAAACAAATA AAAGATTGAG GAAATCCAAG 
237251 AGCTTTTATT TATATGGGTC ATGTTTATTA ATGTTGACTG TATTAAAAAT 
237301 TAACACTAAC GTTTAGAAAA AAGTGTTTAT TAATTAATTT GAAAATCACA 
237351 ATAATATGCT AATTGTGTTT TAACGTAAGT AACATTTTTA TGAAAAGTAA 
237401 CTTATTTCCA AAACAAAACA CAGTCTCTTC AATGTCTGCA TTGAAAGAAA 
237451 AACAGATGGA TTTTCATAAC TATTTCTGCA TTCAATCTTT TACAGCTTGT 
237501 TGTTTTCACT GAAGTGCTTG AAGAAAATGG ACCTTATGCA GAATTGTAGT 
237551 TGGAAAAGGC AGAAATATTT TGTGGCCTTT TTAGATAATT GTGGTTCTTT 
237601 TTCTTTGGTA GCACACCACA ACTCTATTAA AGGTAGTTTC TTAAAGGTTA 
237651 GTGGCAATAT AGGATCTGAA ACCATATCTA TTAACTTTTT AC TATGAAAA 
237701 GTTTATTAAT CTATTTTGTA CTCTTTTACC CATGCACATT TCTTTAATGT 
237751 CATCCATTGG TCATTTGGAA GATGCTGACT TACCTACTAA TTTAGATCTT 
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237801 CCAAATATCA ACAATTTTCA CTATACAACG TTGAGAAATC ATGCTTGTTA 
237851 ATACAGTTAA TACAACCACA CATGTCATCA GAAAAGTCTA AGTATTGGAA 
237901 AGAAGTTAAG CTCCCATGAC AGTTAGAAAT TTCCTGAATT CTAATTTTGG 
237951 CTTGAAAGTT CCTATTTTAA CACTGGCCAC AGCTATTGTC AGTTGTTTTC 
238001 CTTAAAGTGT TAGGCTCACT TTGTTCATTT TAGAGAATGT ATCAGGCAAA 
238051 TATTCAAGTG TGAATAATCA CAGTGTGCCT GTCAGTTGTT CTTTCACATA 
238101 AAAGTGTCAT TTCATTAAAA AGCAGCTCAT CAACTTACAA CTTCACTGTA 
238151 CAACCATTGT ACTTTGATTA GTAGTAGAAG TGTTTTATGC ATTGTTTCAC 
238201 CAAGAAAAGG CTTTTGACGA GTTCGTCTAC AGAGGCATTA TGAGAAAGGC 
238251 TATGTGGGAG GTGAATATTT TGTGATTTGC ATGTCTGAAG TTAATTTTAT 
238301 TCTATCTTCT TATTTGATTG ATAGTTTTGC TAGTTACAGA ATTCTGTTAG 
238351 AAATCATTTT CCCTTAGTGT GTTGTTCCAT CCAGCCTGCT AGTTTCCAGA 
238401 ATTGCTATTG AGAAACCCAT GGCTCCTCTG ATTTCTTATT GTTCATACAT 
238451 AACACTTTTA TTTTTCCTCC ATGAAAGTTC TTAGCATCTT TGTCCCCAGT 
238501 GCTCTAAAAT TTCAGAATGG TGTTCCTCAT TCAGGGTCTA GTTTTACTTT 
238551 .TTGTGTTGGA CATTCAGTGG GCATTTTCAA TCTAAAAACT CTGGCCCTTC 
238601 TATTTCTAGA AAATTTAAGT CATTATTTGG TTGATGATTT CTTCCCTCCT 
238651 GTTCTTTGTT TCCCTTTCTT TCTAGTACTA ATATTTAGGT ATCAGACCAC 
238701 TGCAAACAAG TCTTCTAATT TTGTTATCTT TTCCATCTCT TTAAAGACGC 
238751 TCTTTTTATA TGGGTTTCCT CAACTTTACG TTATACTGAT CCAAAAAATT 
238801 ClVSATATCAC ATTTTTAATT ACTAGTGTTT TATTTGAATT CTCTTTTATT 
238851 TTTTAAAACA ACATTCTAGC CTTGTTTCAT GGATATAATA GCCACTGTTA 
238901 CTACTCTTAA TAATAGTTTC TTTCATTTTT TTCTCCCTTC ATAGTCCTTG 
238951 CTTCCTCCAA GGACTTTTTT CTATTTATTT GACTTGGTCT CTATATTAGA 
239001 GACCTTCCTC CAAGGACCTT TTTCTATTTG TTTTACTTGG TCTCTATATT 
239051 AAAGATATAG AATTTCCTCG TGTCTAATAA CCCTCAGTTT TCTGCCCATG 
239101 ATTTAGAATA GAAATGTAAA ATGTAATCGA AACCTCTGCG CACATGAATA 
239151 GGGTTTGCAG ACTTTGGTCT CCTTGGCAGG GTGATGAGGG TGAGTTGTTA 
239201 GAGTTCTTTG ATGTTAATGT CTTTGGGCTT TTCCTTGGAG CAGACTCTTC 
239251 CAATCTTTTG TATTGAAAAT ACAGGTCTGC CTACCAGAGT TTTGGAAATA 
239301 AAAAACAGGA AAGGGGTAGG TTGGTCAGCA TTCCATAGTC ACTTGGCCAC 
239351 ATCATCTCAC TGTTCTTAAA AGACTGTCTA GGTTTTCACA TGAGTCTGGA 
239401 GTCCTTTTGT GCTGATACCC TCTATTTAGA GTCTCCAGAA TTGAACCTTC 
239451 CAGAAATCTT TTGAGAAGTC TCCAGCATTT TGCCAGAGTG GAGGATGGGC 
239501 AGTTGTCTAT GGGCATAGAG TGGGGAGGGG TTCCAGGGAT CTATTTTTTT 
239551 TCTCAAATAA TGTTAAACCA ATTATCATCG TTCTCTTCAT CTAAATTACG 
239601 CTTTGCTCCC AGTTCCAGAG ATATCTGGTG CTGTCACTTC CTGTACCTTT 
239651 TGATGATTCT GCAGTGCAAA TATGGTTGTT TCTCAATTTT CTATAGTACC 
239701 AGCTTAAGCT TGGACTGATT AATTATTTAA TTAAGTCTGC TGAATTATTC 
239751 AATACTCATC TTTCAGTCTT CCAAAAATTT GTTGCTGTTA AATCTTCTCT 
239801 TTTTATTTCC ATCCTTATAA GTTTATGTCT TCCAAAAAAT ATTTACTTAT 
239851 CTAGTGGGTG TTTTTAGGTA CTGCCAAATT AGATGCATGT ATCCAATTGG 
239901 CTATCTTTTC CTGGTTTTGA CCTTTCTTAC ATTTAAAAGG AGAACTAAAA 
239951 TGACTTAAGC ACCTAAGTGT TGGACATTTC AGGTCACCAG CAAGAGCAGG 
240001 ACAAAAACTT GAAACTTTGG CGATCTTTAA CCATGGAATT CAGGATGCTG 
240051 GTTCTCTACC AAGTGGGCTT TATACTCTTC TCAAACAATT ATTTAACTCC 
240101 ACTTCTGTGA CCTTCTTTAG CTAGATCCTT ATTTCTCTAG AAAATCGTTT 
240151 CATTTCTCTT TGGTTCATTT GTGCTCAATT CTTCTCAGTA ACAGTTTTAA 
240201 TCTTCCACTA CTCTCTTCTA GCCCAATCCC ATGTCTCATT CGTTCTCAGA 
240251 AAATTATGCA AACTCTTACT TAATATGGAA TCTGTTCATG TACCTCTAGT 
240301 TCTCACCTGG ATATCTAGAC AGATCTGCAG CTTGTAGCTG GCCTCACATG 
240351 AACTTCCTTA TTTCCAGTCT CAGAGGAAAT AAAAAGCTTC. TTCCTCATAG 
240401 AGACTGATCT TACCACCAGA AGTCTGAACC CCATCTCCTC AGATTTCTCC 
240451 TGAAAATTAC TGCATTATTA TTTCCACCTT ATTCCTATTT CCACTTTTCT 
240501 ACCTCAAAAA CTCCCATGTC TCACTTTTGA TGACCCAGCT CATGCTTCAT 
240551 GTCCTTTCTG AAGCTTCTTT GACCGTGTAC ACTCAGAGGT GGTTGCTTCA 
240601 TACATGGCAC CACCATTTGA CCTCATTGCC CCCATACTTG AATGTGAACT 
240651 CAGTGAAGAA AAAGATCGTT ATTGCTCACT TTTGTACTGC TAATGCCTAA 
240701 CATGTTTCTT AAAACATAGT AGGCATGCAC ATTTTTCTCT CAATCAACAA 
240751 ATGAATGCTT TAACTCTGGT GCTAAGAGCT GGAAACTTTT TGAGTTCAAG 
240801 CAGTATTTTA ATCAATATTC TTTAGTGAGA AAACATTATT AGAATATTCT 
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240851 AATTCTAAAT CATTGTTTAA CTTAGGAGCT CAAAAAGAAA AAATACTAAT 
240901 GGTGAGCTAT CTTGCCACCT AAATACAATG ACCTTTTATA TTGTAAAAGT 
240951 GAGAATACCG TTTTCCGCAA GTATAATTTT TGCAAAATTC AGAATAAAAT 
241001 GCAGCTTTTT GGTGTTGTTT CCCCCCTGTT GCTGTTACTC CGGACAGTAT 
241051 GTATTTTGCT TAAATTTTCT AATCAGGTCA GTAGTACATC TGCTGTTTCA 
241101 TTCCCATGCT GACTCATATC TGATAGATTT GTTTAGGGGA AGAATACAGA 
241151 AAAATTTAGT AATATTTTTA GAGGAAAAAA ATAGTATTTT CTATAGAATT 
241201 TATGATTTTA TAGTATCATA ACTTTTATTT ATCTGGGGTT ATTACCTTTC 
241251 CCGAGCATAA ATTTGCCTCC TCACAGCCAA GTTTCTTAAC AGTGGTTTAT 
241301 ATTCCATGTG CCATTTTTTG CCTTTTATTT AGCTTTTGAT CCATTGAAAT 
241351 CTAGTATTTC TTTTACCTTT CCACAAAACT GCTCTTAGCA AGGCTAAATG 
241401 TGACCTAGTT ACTGCTAAAC TCAATGAATA GATTTTGGGT TTAATTAATG 
241451 AGTTAAGTAA TATAACCTGC AGTTTGTTGT TAATCTATAT GCATGTGCTG 
241501 AAGTGTTTCA TGTTATATAA AGAATTTTGA ACCGTAATCT TATCCCTCAG 
241551 AGAACTTATG GACAAGTTAA TACACGTAAG AACAATACAG AACTAAGTTA 
241601 CTAAAATCAG AATGTGATTA GTACCGTAAA AGAGGCAAAG ATAAAATGTG 
241651 AAAAAATCAG GAAGAAAATC TGAGGAAAGA GGCAACATTT AGATGGGCAT 
241701 TGAATAATAT GTAGTAATTG GGGATGGCAG GAAGGGAATA AAATGAGCAC 
241751 TTTCCTAACC TTATTTTGTT TTCCAGGAGG TCAAAGTCAA TTCTTTTAGC 
241801 TTTGATTTTA AAGCCTTCTT TGAATGAGGC AATGAAGAGG CTGGAAGAGA 
241851 TATTTTGAAT TTGGGTACTT TACTTGCCTT TTCCAAAGAC TACAAACATT 
241901 TAATTTGCTT TAAGACTGCT CCTTGAATTT TTATTCCAGA AGACTTGCCT 
241951 TATGACCATA ATTTTGTAAG ATAGAACAAA ATTTCCAAAC CAGCACTTTG 
242001 TTCTCTTATA TAATGTACAA AATGCTGAAG TTACAAACCA CACCTGCCAT 
242051 GGCACCTCCT TAATGCTGCC AAGGTAGCAG TCAGCCTCTG AGTTGGGCAC 
242101 TATCTGCAGT TGAAGACTGT TTACCACAGG GAAAGGAAGG GAACTGAGTC 
242151 CTGGCAAACA CTGGTTCTGC ATTCCACTGC TTTGGAGATG TGGGATGCAC 
242201 AGTAGAATCC AGGACTGAAT GACTGCCAAA GCTCCCTAAA AATTATAATG 
242251 ATTGTTTTAT TGGGCTCATC AATTATCTAC CTTACATGTG AACTCCTTTT 
242301 CAGGTCATAA ATGTTGCGAC AACACTCAGA ACAGGAAGAA ATGAAAGCAG 
242351 GAGAAATGGG CAATCAGGGA GCAATGGTCA AAATTCCTAA TAGGCAGATC 
242401 TTGGGACCTT AGAATAGTAT AGGCTGGCTT GGGGGACTGC TGTCACTAAT 
242451 AGCTGAACTA GATGAAGTAG TCTGTCCCTG ATGTCTGGGG AACTATAGTG 
242501 CAATGGATTT TATGTGTGCA TTCTGTCTTC ACAGTGATTT ATGACTTTGG 
242551 CCATAGAGAA AATTTATTGA AGGCATTTGT CAGGGATTGT GGAGCAGATT 
242601 GTTATGCTTA CATAAAGAGT TACCTCCAGT GTTTATTTCT GTGTGTTCAT 
242651 GAAAGCATCT GTATCTAGAA CTGAACATTT CTAAGGCACT TTTGGGGGTC 
242701 TTGTTTGCTG AGTTATTTGC AACTGGAGAA CTGAGGGTGA GGAAAGCTCC 
242751 AAAAATCCCT TATGAGGAAA TCATGGCTAT CTATTTTAGT GTTTTATAAA 
242801 TTTAGGATTT GGATCACATT TTCAGCTGGA AGTTTTTGCA GAAGAGAATT 
242851 ATTCCAGAAA ACATAAATGT CACCATTAGA ACTGTGTAAG ACAATGACAG 
242901 TAGCAGAGAT AATGTTTTGC TTTTTTATGG ATGTGGAACT CGACTTGGGG 
242951 CAACTGATAA TTTTTGAGAT GTTTGTGTTT GGTTCTGGTT AGCAAGCAGC 
243001 AGGGAGTGGA ACCTGTCAGG AATGTCAGTC AACTGGATCT GTGACAGTCG 
243051 GTCACCCTGA GGTTGGTCTG CATCATACAG ACTATATATC CAGTGTACTT 
243101 GGAAGCAGGC AGCCTCCAGG ATAAGAGCAA ATTATAGTCA AGGATCAGCT 
243151 AAGAGGACAG TTTGAAAAGT TTGTGGTGAT GATCAAGAGA TGAATAGTGG 
243201 AAAAATCAGG AAGCCAAATC CAGTAACGGG AGATGCAAAG AGGGGCAATT 
243251 CACACTTGTC AATGAGTGTA GGTCAGTTCT TATTGGATAC GGGAATGGCA 
243301 GCCAGCTCTA AAGCTGGACT GGCATAGGCT AGTCAATAAC TTCACTTATT 
243351 TGTTGATTCT AATCCTGCTC TCACGTTCTA TGTGGTTTTA GCAATACCTC 
243401 ATGAGAGAAA GAGTACAATA AACTACTTCA TTAATTAATT TGGACGATAA 
243451 ATTGAAAATG TAGAGCGAGC ACTTCCCACT TTCTATGTGT TTGTGTGTTT 
243501 GGTTGGAATA TTATGGGACA TAAGTCTGGA GAGGTGGCTT TCAGACAGTG 
243551 AATCACACAG TAGTTGATCA TCTAATGTAA AAGCGAGTTT AGAAATTTGT 
v 243601 GTTCTTTTTC TCATTAAACT TACATTTTAT TGCTACTTGA GAATTCATAA 
243651 TACTATATTT GCTTTTTTTT CAGAAAGTCC TTCAAGTGAT TACTACTGAA 
243701 AGAATAAGAA AATGTGAGTT TTACTTTTTG TTGTATTAGT ACAGTTACAA 
243751 CGGTTAGTAG GATAAAATAA CTTTTTTTAT TCGTATAGCA GAATAGTTAA 
243801 AAGTATGCAC TCTTATGTAG TCAGGCCTGC CTGCTCTGCC TCCTACCAAT 
243851 GGCTCTGGCC TTTGCAGAGC TATATGTGCC CAGCAGGGGA AATCCTCATA 
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243901 GCTCACATTC AACTCTGGTA TTTCCCCATA TGCAAGTGTT ATTATGGGTC 
243951 TGTATCCTAC TCCTGAATAA CTCAGGAAAA GATATTAGGA AAGCCCTGTT 
244001 GATAACAAGA CACATAGGAT AAAGAAGTTC TGTCTTATAT TTACCTGAAG 
244051 TAGCTTTGAA ATTAACTTGT CACAGTCTAT GGACCATTAA ATATAAGCAT 
244101 GTTGGCATCT TAAGGTTGGA TGGCAGAGAT AGGCATATAT CTCTGCATAT 
244151 GAAAAATAAG CAGCAGTAGG GTTATGGCTA TCATTAGGGT TACTCAGAAT 
244201 GCTATTCTGA TTTTACTGCT ACTTGGGTGC ATGCTGCACG TCACTTTCTT 
244251 GTTACCTAAA CTTGAATGTA GCCATGAGAG TGGCTTTAGC CAATGGAATG 
244301 TAAGTAGAAG TGTTTTGTGT CTTTGCTGGG CATAAGCCTT TAAATAAGAG 
244351 CATGATTTTT CATGTTTCTT TCCTTCTGCC TTGTTGATTG TGAAAGCTTG 
244401 TATTAAGATG GGGCCTTTAT CAGCCTGGAT CCCTAAGACT GAAAGAGCCT 
244451 CACGTGCTCT TCCTCCCGTG CCAACCAGGT ATTGGCATGC TGTATAATGA 
244501 GAAAGAAACC TTCATTATTT TAAACCACTG GGATTTAGAG ATTACTTATA 
244551 AATGTATTTA TTTATTTATT TTTATTTTTT GAGATAGAGT CTCCCTCTGT 
244601 CATTCAGGCT GTAGTGCAGT GGCACAATCT CAGCTCACTG CAACCTCCGC 
244651 CTCTGGGGTT CAAGCAACTC TCATGCCTCA GCCTCCCGAA TAGCTGGGAT 
244701 TACAGGTGTG CACCACCATG CCAGGCTAAT TTTTGTATTT TTTTGTAGAG 
244751 ATGGGGTTTG CCATGTTGCC CAGGCTGTTC TCAGACTCCT GGCCTCAAGT 
244801 GACCCACTTG TCTCAGCCTC TCAAAATGCT GGGATTATCA GTGTAGGCCA 
244851 CTGCACCCGA CCTTGGAGGT TATTTGTTAT TGAACCATAA CTTTGCCTAT 
244901 CCTGACTCTG CAAGTGGTTT CCAGAAAAGA TCAATTCTTG AAGACCATGA 
244951 AGCCACAATT AGACCAGAGA CTTCCACTTG AGTGAATCCC GGGAACAATC 
245001 TGAAGCATAG AAGGTGTTAC AAGCAGGGGT CAATCATGAA GAAAGATGTG 
245051 GCAGGCAGGA GTTAGGTGAG GGTCAAGTCA AAAAAGCCAA CCATTAGAAA 
245101 CCATATTCCA GTTCCAGTAG GAAATGCAAA TGACTCAGTG GATGGGTCAT 
245151 GAGAAGACCA GAACACAGAT GATAGGTGTT AAGATGAGGA GATGATCCCT 
245201 CAATGGTCAT GGAGGGATTT GAATTTTCTA CAGTAGATTT CAAAAAAATC 
245251 AGGGAAACAG CCCAATATCC TTTCAGATAC CATTCTTATT GACTAGAGGC 
245301 CTGGGCTTGC CTAACTTTGC AATCTGTAGG TCATCACTGT GACCAGCTAG 
245351 TTAGGGCTTG GGCAGATGGG ACTTGAAACA AAGAGACGGA AATTGGTAGT 
245401 AGAATCCATC TGACACTTTG ACTTTTTTTT TTTTCTAATG GGATATTTTC 
245451 ATTAACCCCA TCCTAAATTT CTATTTCCCG GATCAAATGA TAGCTGGTCA 
245501 AGGAACATCG TGGGTTATAT TTTCTGAAAG CGTTAGCCAT CTTAATTTTT 
245551 TTTGTGGGAA AGGTGGAAGG GACTCACTCC TAACTTTTCC CAAACTGTTT 
245601 GACAACAACA TATTTTTATA CGGTTATCAT AAGTGTTGAT TACATTTGAA 
245651 ATTGGTCCTT TGGACTAAAT TGTCTAAATT TTCTAGTTCA GTATGATTTA 
245701 CTGTATAGTG TTTCTTCAGC AGATCTAAAC CTTAAAAAAA ACTACTTCCA 
245751 TGTGACAAAC CGTGATGCAA TTTTGCAGCT GTTTTCATTC TAAACATATT 
245801 GAGCCCAATA TATCCCAATT AGAAGAAATG TATATGAATT ATCCTTATTA 
245851 AGCGGTATTT GTAACTCTCT AGCACTTGTC TAAATTTCTT TGGTCTCTAT 
245901 TTTCTTCTAG TTGTACATTC CAATTGGCTA CATTAAGATA CAGCAGATTT 
245951 TACAAAATAA TGAACAATTT ATACATGTTT TAAATAGAAA TGTTTAAAAA 
246001 TTACTATGGT TTTCATTTCC TTGATTCATG AAATGATAAT ATAACAGAAA 
246051 GGGAAAACAA AATCCAAAAT GAGATTTGGG . TCATATGTTT TGCTCTGAAC 
246101 ACTTGATTTT TCTTTGGAAC TTGGAGTTTG ATATAATGTG GAGCCTGGAA 
246151 AGGGAGCATA JTGGTGTATAC GAGGCACTCT TTTCTCTTAC TGTGATTTTT 
246201 CTCTATTTTC AAAATGCAAC TTTGGGAAAA GATTTTAGGT GGATTCTGGC 
246251 CACCTACCTA TAAATGGTAG TGTCAATATC TTTCTTGGAT CCTGCTGCCA 
246301 TCCAGGTGGG ' GGTAGAACCT TTTTGAAAAA TTGTAGAAAA GGGGTGAAGT 
246351 CTCATGTGAT TATAGTTGTA GAAATCCAGC AACTCATACC CTCTTTGTGG 
246401 TAGGCATATA AATGCTGTCT CTATCAATGT GCCTGGAGAG GGTTCCTTGA 
246451 AGTACTAGTG GCTCCTTCCA TCCTGTACAG TTCTTTGATG GGGATGGGAT 
246501 ACTCTGAACT CTTGGACTGC TCTCCCTCCC TGGGCTCCCC AGATAATTTA 
246551 GTTTGGAAGT TTCTTAGA^A CCTAGTACAA AGTGCTATCT AAGAAAGGAA 
246601 TTAAGCCAAT TCTACTGGTA TAGAACATTC ATTTATTCAG AGATTATCAA 
246651 ACTGTAAACC ACCTGATGAA ATGTTCATAG GTTTTTTCAT TGAGTATTTT 
246701 GTATGTTTTT TACTTCATTA CTTTCTTCTC TTCTCTTTAT TGTTTCCCCC 
246751 CTATAATTTA TTTGAGTTTA TTTTGCCATT CTTTTTCCAA CTCTTTGATA 
246801 TGAATAATTA GCTTATTAAT TATTAATTTT ATGCTTTTCT GCATTTCTGG 
246851 TATAGGTACT TAATTATATA CATTTTCCTT TAAATATTGA TTTAGCTGCA 
246901 TACTGCAAAT TTTCTGTGTT GTTTTCATTA TTGTTGAATA CAAAATGTTT 
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246951 AAACATTTTT AATATACTCT TTTTGTGATT TATGAATTAT TCAAAGCAAT 
247001 GCTATTCAAG CAAAATACAA GATAAGCCAC AAATTCCAGT CACATTTGTG 
247051 ATTTAGAATT GTAAACTTCA AATCATGAGA TTTGTTGGCC CTATGGCTGC 
247101 CTCTGGTTGT GCTCAATATG ATGCCTGGAT ATTGCTACAC CAGATTCTTG 
247151 CACACAGCCT TTTGCCTGGT TCTTATTGGC TCCATCTCTT TTCCCATGAT 
247201 TCTCCTGATT TCTCCAATAC TCTTGCCATT AACTTTTGTG GCTGCAGACA 
247251 GTCTGATCTT ACTTCTGTCA CATCCTTACC TCCACTTTGC TACTCAGAAC 
247301 TATAACCTTC TGAAACTTAG CCATCCTCAA ACATGTGCAG ACTACTTTGT 
247351 CAGAATGTTT AGATCCCTTT TCTGTTTTAC AAGAGAAAAT ATGTGTTTAT 
247401 AATTATGTAT AAATTAATTT CACCAAACAT TAATTTATTT CATTTCACAT 
247451 TAAAGTTTAT CTGTAATATT TTAGGCTGTT CATAAAAGCC TCCTGAATGT' 
247501 AGTTCGTTAG TTCAATCTTA TAAACCTAGC ATTTCATTAG TAGGATACAT 
247551 TTCTGATGAA TAAATACTGT TGGTTTTCAA CATCAAACAC TTTGCAACAA 
247601 AAGTTGGAAA AGAGTGTCCT GCATTTACTC AAGACTACTG GATCTTATTT 
247651 CCAACCAGGA TATATTTATT CTAGTTAGTG TTGATGTGAC AATGGGGACT 
247701 GGTGACTCAG AAAACTTAAT TTACAAATAT TTATAATTTA TTTTGCCTAA 
247751 AATGACATTA GGGACAGAGA GGTGACCATT TGGGGAGTTA GACTGGCATG 
247801 AGCTAACTAC CAGTTTGTCC CAGGCAGTCA AAAAATACGG CTTCAAGGAA 
247851 AGCAATGGTC AAAAATAAAG GTTGAAAATA AAAACATTGT CTAGCTTTGG 
247901 TGCGATATCC AAAAACTACA ATAACTATTG TATTTGAAAA CTTTTTAAAT 
247951 ATACCTTAAG CTTTCTGCCA CCTTACAGAA AGGACAGTCT CTTGGCTGGC 
248001 TGTTATGGCT AAATTTTAAA AATATCCATA GGGAATATCA GTCCTTCAAC 
248051 CTATAGATAT AAAACCCCCA AATTCTGAAA CTGCCAACAG TCTTTTTTTT 
248101 TTTTTTTCCA CAGCATTTAC TTGTGTTTTG ATTGCAAGAA GTACCCAGGA 
248151 AGGAATAAAA TTTAGTAACA GAATGAGGAT GTTATGATTT TTTTTTAAAC 
248201 TGGGTTAGGG GTGGGGAGAG GGAGAGAGAG AGAGAGAAAT ATTTTGCACT 
248251 vGGAAAAACTA TAATCTGGAG AGACAGAGGA GTTGAGAGTG CCCCTTTAAA 
248301 CTGCAGGACT GCAATTGATC TCTGTATGTT TTCTGGTTTT CGTCCCTTTG 
248351 AAAGGGGAAT GAGAGCCAGG ATCATTGGCT GTACTGTGGA GCTGTCACAT 
248401 CTCCATACTG AAAAGGTACA CTGCAAGTGA ATTTAAACCG TTTTTGGCTT 
248451 TCTATTGCAT ATTGCCAAGA GCTTTGAGAC TGATGAACCA AGTAAATGCT 
248501 CTATTTAGGG CTAAGTGAGA TACAGATTCC TCCAATCTGA TAGCGTTTCA 
248551 GCCTCCGGAG TGAGGAAGCA GCAGAAACAG AAGCAGCAGA AGCAACAGCA 
248601 GTAGCAGCGG CAGCAGCAAC AGCAGCAGCC CCTACTGAAG TCCAATAGAG 
248651 GAGACTTGAT CTCTAGTTCA TTCTGGAACT CCGCCTGGGA TTGTGCACTG 
248701 TCCAGGGTCC TGAAACATGA ACCAAACTGC CAGCGTGTCC CATCACATCA 
248751 AGTGTCAACC CTCAAAAACA ATCAAGGTAG GATCTGGTTT TTCCCTCTGC 
248801 TTCTGCACAT GTGTCTTCAT TCATCTACCT AACACATGGG CATTGATATA 
248851 TCGGGGTACT GACAGTGGAC ATACCTACTT GATTTGAGTT TGAATTGAAG 
248901 GTTGAGTCAC AATAAAGCAT TTGGCTCTGC ATGTGTGTGT AGACTGGAAA 
248951 GAATAGTTTC CACTGAACAC TTCTTTCATC TTCATGACTT TTTTTTTTTT 
249001 TACTGAGAAT CTATCAGTGG TATATTTCCA GAAGAAAGTT TTGTGAGTGT 
249051 CTAGAAGAGT TTCCAGTCCA TGCCAATTTT CTTGATAGAA CTGGGGAAGA 
249101 ATTTGCAAAC TGCTAGAAAT AAATAGCATC AAATTGACTA CAAATCTGAT 
249151 AACAGGATTT CATTGTCATA TGAATTTTGG ATCTCTTATC CTTTTTTAAA 
249201 AAAAGCTCAT CCTTCATTGT GAGCATTGAA CTGGTGGTTA TACTAAAGTG 
249251 ATTATAGTGA GAGAGGATGA AAGACTAGAG AGAGAGTATA AAATCACATG 
-249301 TCTTTATTTA TATCACCATG' TAACTCAAAA CCAGTGAGAT GATTGGAGAC 
249351 ACTCTTCAAT GCAGGTGTTG TTTAAAATTT GAGGCTTTTC TGACAGCACA 
249401 TGGTTAAAGG TAAAGAACAA ACTCACAATT TCACTCTTCT GTTTTAGTTT 
249451 CAAGTCATGC TTATACCATT CTCTTTGTTT TTATTCGGGA TGCTCTTATT 
249501 TCCAAGAAGG CTTATAAACG ACCAGACTTC CAATCAAAGT TTACTATATT 
249551 TTGCCATTTA CTTGAATTGT ATTATTATCT ATGTGGATAT TTTGCATCTT 
249601 GAGGGAAAGT GATGGATTCG TCTGAAACTC TTTTATGAAA AAAAAAAAAA 
249651 ACTAAAACAA AAAACGTGAT ATCCTGACAG GGCATCTTTA GTTACTGCAA 
249701 TCTCTCATTA ACATACAAGT GCATACCTTT TGCTACTATT CCTGATAAGC 
249751 CTGACTTTTT ATGATTTATA TTATGTTTTG GTTTATGTCT CAGGCCTTTA 
249801 ATAATATTGC ATAGACTGTA ATTACAAAAG GAATATAATT TTCAACAATG 
249851 TCTCTTCTTC ATGAATTTAC CTACCGTTTG CCAGAGTGTT TCAGTGATAG 
249901 ATAAACCATA" AACTAAACAT GTTCCTCTAT TTATTAGCAG TGACTTTTAA 
249951 AAGCAATGGG GTTTCCAGGG TATATTAAAT GATGCATAAT TAGATTCCCA 
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250001 TCATACATAA ACATAGGGAG 
250051 GATAATTTAC ACCACCTCTA 
250101 AGATTTTCTG TGCTGTATCT 
250151 GGTATAGTTT TCCTCTGTGT 
250201 GCATATCACT GAACAAAGCC 
250251 GGATATATAA GCTCTAGTGG 
250301 GAGTTACAAA CTGGCTGCCT 
250351 AAACCAGTGG CTTTCTGTTT 
250401 TATTTTCCCT TTTCTTTATT 
250451 GACTTCAAAT CACTTCCTTT 
250501 CAAACTGTGG AACAGAAAAA 
250551 CATCTTGCTG ATCTTGCAGA 
250601 GAGTTGAAAT GAAAAGCATC 
250651 TGTATTATAT TAATGGCTTA 
250701 AATTTGGTTA ATTGTACAAG 
250751 ATTGCTCTGG GGAGCAAAAA 
250801 GTCGCAAGCA CTGAACTATT 
250851 TTGCAATAGT GAAAAGGGAT 
250901 GTGCTCAATT ATGAAACATT 
250951 CTATCTATAT TTAGGTAATG 
251001 TCTAAAGAAA AACTCTAAGA 
251051 GGAGCTCAAA ATACTCTTTG 
251101 TGTTTAAAGT TAATTTTATT 
251151 ATCAGATAAT TTTAAGTAAC 
251201 GTAATGATAA CCCACTTTGA 
251251 TCCATTTTAG AAATTCAAAA 
251301 TGTAATAACC TCTAGAGAAC 
251351 TGTTTCTTCT CTCAACAGTT 
251401 TGAACAAAAG TGAAATGCAT 
251451 GCAGGAGGAG GAAAAACAGC 
251501 TGTGTGTGTG TNNNNNNNNT 
251551 AGTCTATAGT GGGTAAAATT 
251601 GGCCCATGAT GTGTCAGAGT 
251651 TGGAACAAGA TGATCAAGAG 
251701 GTTCTGGCCA TTCTTTAGCT 
251751 TACTTTGGCA GGTATGTTCC 
251801 GGAGTCAAAT AGGGGTGCCA 
251851 GCTGAATGCA ATTCACTCTG 
251901 GGAAGAAAGC TTTTCCTGAT 
251951 ACATTCTTTT TAAAAAAATC 
252001 TAGGAAGTAT CTGGGAATGC 
252051 AAACGTGATT TCTGAAGAAT 
252101 TTTCCTTCAC CGTTATAGCT 
252151 CTTTCAGCTA AATCTCTGAT 
252201 CAAACTTAGA ATTTTGCCTC 
252251 TCACATTTGA ATATCAGTCT 
252301 AAAATCCTTC ATCCTTTTTG 
252351 GTGGTGATTC CTGGATCAAG 
252401 TCTGAAATAC AGTACACAAT 
252451 TCTTCCTCTT TCTCTTTCCT 
252501 ACTGCCTTCA CACCCAGGTT 
252551 TCTAAAAGTT CTAGTTAGTT 
252601 ACTATACTTA AACGAATCTT 
252651 TGAGCTGGCA AACTTTTTCA 
252701 TATTGATGAG TCATTGTGCT 
252751 ATCCATAGCC TCAGCCAAAT 
252801 AAAGAGGATG AAAATCTTAT 
252851 TGGAACATGG TGGTTAGGAA 
252901 GTGTGACATT GCCATGAATT 
252951 TTGCAATCTC ACTGGTTGTT 
253001 GAGTCTCAGC TCACTGCAAC 



TTAACTGAAA TCATGCAGAA CTCATATTCA 
ATAACAGATA TGAGAATCTT TTTCAGCATT 
AAGTTTAGAA ATAAAAAATG TTGAGCTTTT 
ATCAGTTTCC CACATACATC AATTATCTAA 
AAAATAGTAA ATACATGACA CTCTACATTG 
GAAATATGTA TTGAAGCCTA AAGTGGGATA 
ACTGCTAAGT ATGTCTAGAA TGCTATAACA 
AAGAGTCTGA GGTCACAGGA ATCAAGCATT 
GAGTAAGCAT AAATGACTAC TACTTTCAGA 
ATTAGTGCTC TGGCATTGAC TGCCCACAAA 
AAAAATGTTT CTAGTGTAGA GGTTTTACAT 
TGCTGTTTTT TCCTTCTAAC TGCAGCCATT 
ATAAAAATTG GACAACCCAC CACTTAACAA 
ATGAAGGAAT CTTGAATCAA AGCTCTGGGT 
TTAAATTCTT TTGTGCACCC CATTCCCAAT 
AATGCTTGTG ATGTTCATAA AATATATATA 
AAGGGAGTAA TTACTTGAGT ATTTTCCAAT 
GAGTGGAAAA TCTCTCTTAG TTCAAATTAT 
TATGGCTTTT TGAATTTATA CTAAAAATGG 
CATTAGCCTA TGAACAAAAT GCAATAACAA 
TGAGAGGGTT TTGGGAAGCT GTGGGACATT 
AACTATTAGA ATTTTAGTTT CAAGCATGGT 
CCTCACTCTT AAATTTACTA TCAATTTACT 
CCTCCCTCAT GGAATCTGGT TGGCCTGCAG 
GGATCTGGAA ATAAATAACT TGCATATTGC 
ACCTCTAGAG ACTTTTCTCC CAAAGCTCAG 
TTTTCCCCCA TCAGTATAGT AAATGCTGAC 
AAACTACGGC TAAAAAGAAT GCAATGGGAA 
TGCTAAAATA GCTGTAAAGG CCAAAAGTGT 
TCTGTGTGTG TGTGTGTGTG TGTGTGTGTG 
CTCCTCCACT TGAAGTTCAA GAAGTGATGA 
CAGCAATTGT TGCCAGTGAA GGCTGTTTGT 
TGGAAAAGAG GAAATTGGGA GAAGTTATAA 
TATTCAGGTT GCTTGACACT TGCTTTGGTC 
CAGACAGTTA CTGACACACT TTTCCTGGCA 
TATCGTCAAA ACCAGTAAGG GAGAGAAAAG 
AGTGTGTTCT CAGATCACTT TGGGTGAAAT 
AAAACTTTCC TCCATTTCAG AGCTTGTAAG 
GTGATTACAA ATCTTCCTGT GAAGGTAGCA 
TTTTACTCTG AGATGTTTAA TGGAAAGTTT 
TAAATAGAGA TCAGATCTTT GCGTTATTAT 
GTCCCCCACC ATGGGTAGCA AACTTGAGTT 
TCTCTTGGCT GCACTGAAAG TAAATTAAAC 
CTTGGTTTAC TTCTTTTGCT AGTGGAATTG 
TAAAATGTGC AGATTTTTTT CTTATTTCTT 
AGTATCCTTG TGTGTCTAAA ATGAGTAGAG 
AAGTTATAAA TTGTGAGTAA CTTATCTGGG 
AATTATATGT ATTATACTTT ATATGGTAAC 
ATTTCTTTAC CCTTCCCATG TAGTGACCGG 
TTTTACAAGA CCTGCCTGAT TTATTTTAAT 
TTCACTTCCA TTTAGAAAAG AGTCTCAAGG 
CCTACATAGC CCTGCTGCAA GTTTTTACAA 
TAAGATTTTC ATTGTCTTCG GAGTACATCT 
TGGTGATTTT TACTGCCTTT TATATTGTAA 
AATAAAGACT TTTCCCCCTT GCTACTTCTA 
GAGGCATAGG AAACAGAAGA CAAGGGGGTA 
CACAAGTATG TATTGTTAAA TACGTAGGCT 
ATAATTTCGT TAGCAAATTG GTATTTTGGA 
GGCATGATGA GATGGCTTGG AACTACAGCA 
AAATTTTTTT TTTTTTTTTT TTTTGAGACG 
CTCCGCCTCC CAGTTTCAAG TGATTCTCCT 
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253051 GCCTCAGCCT CCCGAGTAGC TGGGACTACA" GGCACCTGCC ACCACGCCTG 
253101 GCTAATTTTT GTATTTTTAG TAGAGACGGG GTTTCACCAC ATTGGCCAGC 
253151 ATGGTCTCGA TCTCTTGACC TCATTATCCA CCCCTCTCAG CCTCCCAAAG 
253201 TGCTGGGATT ACAGGCATGA GCCACCGCGC CCGGCCCACT TGTTGTTAAT 
253251 TCTATACCCC TAACCTAAAT GCTGGACACT GACAAACCTG CGTTTCACAT 
253301 AGTACTTGAA CTTGAATTGT AAGTTATTCT TAAAGGGTAT TTCATAATTA 
253351 AATATATAAA TTATATCCAC AAAAATTTAG AAATGCGAGG AATGTTTAAT 
253401 AACAAAAAAT CTACCTCTGT GTGTCATTGC AATATGCAAA ATGGAAGATT 
253451 CTCCAATGTC ATTATTACAG TATATGAAAT TCTAAGTGGA AAGATTGTTA 
253501 CATACTTCTA CATTGTACCT TATCTTAGAG AGTCTTCCTA TTCCTAATCC 
253551 TAAGAACTTT ACTCATAAGA ATACATAAAA AGAAACTTTA TAAAATTATC 
253601 GACTCAACAT AGATTCATAC ACACTAAAGA AAGAGTTACC ACTGGTAACT 
253651 CTACAGCAAC TGATTTCACC AGTGAGTTCT ACACTTCCAT TTAGCAAGGT 
253701 ACAGATTTTT CTTCAGCATT AGGATGAGTC ATTCTAAGCG GTGATAGCTA 
253751 AGGAAATCAA AAGAAAAAAC TTACAAAAAT TAGAAATAAG GATATTGTAT 
253801 GTGTGTGAGA GAAAACTTTT TTCTCTTAGC AATGGTCTAC ATATTTTTAA 
253851 TTACTCTGAC ACAGCTTCCT TATATGCTTA AGGCAAAATC ACAGCCTGTG 
253901 ATAAACAAGA ATAGGACCAT CTGCTAATGA CAAATTTATC ACCACTTACA 
253951 ATTTCCATTT ATGGCCAAGT ATTTAAATCT TAAATAATTA TAGCACAATT 
254001 TATTTCAAAT TACTATAAAT TCAGCTGTTT TTAAAATGAA GCATTATAGT 
254051 TTTTTTTCTT TTTAATAAAG AGTGACTATA ATAATTGGCC AGCTTTACAT 
254101 TTTATAGCTT AGATCTTCAT AAAATCAAAA GTTTCTTTTT CCTTTGACAT 
254151 TTATAGTGTT TGAATGTCAA CTTTTACTTT GAAGAAGTAT TCTTGTATAA 
254201 TATAATTTCA AGGTAAATTT AAACAGTTTT ATGAAGTATT TAGTATTTAT 
254251 TTTATTCTTG AGAAAGTAAT GTGTTAATAC TTTTCTAGAA ATTCATGTTT 
254301 TGGTCAGTTA GTGAAGACAT TCTTGATGGT AAATTTGTGA TGACAAATTA 
254351 CTAACTCTCA CTGAAAAGAC ATTATGGGTC TACAAATGAA GCATGGAGGT 
254401 AGTAAGCTAA GATAGTGGTT ATCAAAGTGC TGTCCCTAGA CCAGCAGTAT 
254451 CAGCATCTCC TGTGAACTCA TTAAAAATGC AAATTCTTGG ACTCTACTTA 
254501 GACCTACTAA GTCAGAAACT CTTGGGGTAG GGCCCAGCAA TCTTTGGTGG 
254551 GTGATTCTGA TCCATATTGA TGTTTTGGAA CCACTGAGCT AAAGAATTAC 
254601 AAAATAAGAC ACATCTGTGC TATTGTCAGC CTGTAAGAAT TGTCTGAGTA 
254651 CCACAGAAGA GGTCTAAACA GCTTCAGAAT GATACTTAGG GTTATTAAGT 
254701 AACATGAATT CTGTGACAGC CAAAATCTCA TCATTAGGTT TATCTCTTTT 
254751 ATTCCTCACA GGGATTACTT GATACTATGC CTTTCACCCA GTAGACATTA 
254801 AATATATATT TGTTGGTTAT TAATGACTGT TAAGGTCATT TAGTAGAGAT 
254851 AATGCTCTAA GAGGACCTAA ATGATATTGT AAGGTGTTAC TCAACATGTA 
254901 AGCAGGTTAG AGGCCTTGCT TTTTGTTTTA AATTCATAGA AAATTTCTCA 
254951 AAGAAGTATA CTGACATTTT GGCTTTTGAT GGTTTTGTTC TCTGATCTTG 
.255001 GTTTGGGGCC ATCTGGTAAT CAACACATTT TGGTAAGTAG GGGGATTTAG 
255051 AATCATACTA ATTTTTTGTC CAGAAATTGT GCTGGAGAAA TGTCTCCCTT 
.255101 CTTGCCAGCT GTATCTGAAA ATTGGCAAAC AATGGTAGGG AGGAATATCA 
255151 TCACATTAGC CTOTCTCCTT GCCTTTATGA ATGTTCTTTG GCCCGTAGAT 
255201 ATGTGCCAGG TTGGTAGGGC ATAACTGGAA GTAGCACACA ACTGGAACAA 
255251 CATGAATAAT TGTGAACAAA GCTGGCCTTC CACGGACAGA CACAGCATAA 
255301 TATTACATTA GCAGAACAAG ATTTAAGCTG GGGGATAAGC TGTGAAATAA 
255351 GGAAATTACC TCCAAACAGG CTTATTTCTT GGAAATTCTT CATAGCACTC 
255401 TGATGTTTTT CTTTTTTTTT TTTTTTATAG TTACCACTGG GAAGAAAAGT 
255451 AAAATGCATA CAGCCTGCCC TACTACCTGC AAACGGATAT AAGGCAGATC 
255501 GGATGGCTTT TTCTTTAGAG GGGCATCCTT AAGTGGAGTT CTCACTCTCC 
255551 CTTTACCACA TGAACCATGT GGTCTTGCGG AAGGCAATTA ACCCTGTTAA 
255601 GGATTTCTCC CCCACTAACA GAAGGGCAAT AAACTTAAAT TATCTGCCCA 
255651 AACAAAAATG CTATGTAAGT GCTTTGTATT ATGATTTGTT CTAGTAGCAA 
255701 TTTTTAAGGA TACGACATTT TCATAAGGAG TTGTATAGTG AGAATCAATT 
255751 TTTATGACAT AAAAGCTGAG AATTTTATTT GCAATTATTA TTAACTATAA 
255801 TGATGATAAA AATTCATTTT GAAGTCACCC GGGGGATGAA AATTATCTTC 
255851 TAATATTGTA GTAACATCTG CAAGATTTCA TCATGAACCT TTACCAGGGA 
255901 GGTAGGGGAA GAAGATTGGA TTCAAATGTC ATTGAAAGAA . CTGATTAATT 
255951 CAGTAAAAAA ATGCAGTTAG TCATCCCAGT GCTGTGCTAC CAAAATGGAA 
256001 AATACCTGGG TAGCTGCATT GATGGTACAT ATGGTTGCAT TGAGGAGGAA 
256051 TTGTATCATG GAATAAATAT AGGCTTCATA TGATCTCTGT GCTTCTAGAC 
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256101 TTGATACTTC CTTTGCCTCA CATACGTTCC CCATCCCCCA AACCCCAAAA 
256151 ATGGTTCTTT TAGCAAACTC CTATTCTTAC TTTAAATCTT ATGCAGAATT 
256201 TTCTCAGTGA GAATTTTCTC ACAATCTCCT AGTCAGACTG TTATGTTCTT 
256251 ACCTAATCGC CGATGTTCTC ATATAGTCTT CTGTACCTCT TTAATAACAA 
256301 TTAATCTTTT GAGTTATAAT TATTTGTTTC CATGTGAGTC TCTCTACGAG 
256351 CTATTAAAGT GCTCTGATTT TTTTTTTTAC TTTTGTTAAT CTCAGTACCT 
256401 ATCCTAATAT CTGCCACATA GCAGGAGCTC AATTAATATT GTTTAAATAA 
256451 AAGAACAAAA AGAACAAAGA AAGCTATAAA TTCATTTGCC TTCCCAAGAA 
256501 ATGTAGTTTG ACTTTTGAGT TCAGAAATAT ATTGGTCTTT GTTGAGCTAA 
256551 GCAGAACTAA ACCTGTTACT TTGAAATATG CAAACAACAT TTGAAGGAAA 
256601 TAACAAGGTT CCTCAGGGAT TGTGAGAACA ATGTGCTATT TCTCTTGAGG 
256651 ATAAGTGGCT TCTGTAAGTA AAAAAAAAAA AAAAGGTTTT GCATTATGAG 
256701 CGTTTTTCTA CATTAAGGAA GTCCTGTGAC CAAACAGAGA CAACCCCTGT 
256751 TGTTCTTCTT TGGAGCTTGC ATTAGTATAG CAGACACAGG TTACTGCTGA 
256801 TTTCAATAGG TAGGGCTTTT CAGGGGGACT GTTTGTTCTG GCACTCTAGT 
256851 GAGGCTTTTT ATAATCAGAA ACCTTAATTT CCAAGATGAA TCATTGCTGA 
256901 CGGACTGAAA AACACCACTT CTCCCAACTC TGGTTTCCAG AATTTTGCCC 
256951 TAAGAAAAAT GAGTTCCACT TCATTTTAAT GGAAAGTAAA TAGCATAAAA 
257001 TTACTTATTG TATTTAAAGA TCGGCCCTTT ATTTGATTTT TTTTTCTGAT 
257051 TCAGTGAAAG AATTAGGACA GAGAGACAAT GCAAGGCAGG CATTTTTGAT 
257101 CAAGGATGGA CAGCTATGGC AAATTTTTTC CTTTAGGTGT ATCCTCTGAA 
257151 GGAAATAAAA TAGAATGAAA GAACAAACTC AGAACAGTTG TTGGATGACC 
257201 AGTGAGCACA GCATGGTGTG AGGAATAGAG CACTGGACTC GTTCTCTCAT 
257251 TGTTACAGTC ATTCAATATT TCATGAAAGA GAATTCAAAT GTTTGGAGGG 
2573 01 TGATAGCTTT TGTTTTTGAA GAGCATGGCT ATGATATAAA TTAATATGAT 
257351 TACAATGTAA AGTGATGAGT ATTAAGAGGG ATAAATACTT AGGTTTGTCG 
257401 TTGGTGGGAA GGGAGGCATT CACAAAACCC AAGCCAGAAC TCTACATAGG 
257451 AGTTAGATTG ATAAAGAAAG GGAAAGAAAA TAGTTCAGAA ACAGAGAAAA 
257501 TGTGCAAAGA TGAAGAAAAA AGGCAATGCA ACTGTCATGA AAGTATTTAA 
257551 TAGTTGAGAC CAACTAGATT ACAGAGGGCA AATGATTAGG AGGGGTGAGA 
257601 AACAAGGCTG CAAATGCATA CATGGAGCTG GACTTTGTGT TGTTTTTCAG 
257651 GATTTGGCTG TACTCTGACG GCAAAGCTTA AATGCAATAC AGCAGGGACA 
257701 AGATCTTGGG ATCTAAATTC AGGACACAGT TCTATGACCC TGGGAATCAT 
257751 TCCTGAGGTT TGTGTCTTCG AGTAAAATAT AAAGATGGTA ACACGTTCCT 
257801 GGTCTGTCTT TCAGGATAAT CCAAATATAA GATGTTAGTA TTAGCCAGAC 
257851 AGGTTTTACA GAAGGTAAAT CTTAACAAGA ACAAAAGGTT TGTTCATTAG 
257901 CTTCATTTAG AAAGGTACCA GCTACTTTCT TGAAGGTTAT TAATAACTTA 
257951 TTAATATACG GTCTTTAAAT ACTAATTTAG TGGTAGCCTT ATAATCTAAA 
258001 AGTATCTTTT TTGACTAATG CCAATCATTT AGCTGTACCA AAATTTTCCT 
258051 CTTTAATATT TGAAAATAAT ATACTGACCA TCATTCTCTT GCATATTTAT 
258101 TGCTGATTCA TTATTTCAAC AATGCCTAGT AATACATGAA TGGCTTCTGT 
258151 TAATAGCATT GTGTATAATC TCGAAGAATA CCTGGAAATT TCAGGTGTAG 
258201 AGATGTGACA ATTATTTTCT TCCTCATTCT CTGATTACAT ATCCTATATG 
258251 TTGTCTTTAA TAAAGATACC TTGTGTTTTT TGAAAGATCA GTTCCAAGTG 
258301 GTCGGGGGTG ATCTCTCTTG ATGGCTCACA CCATTTTCCA GAAGATACAG 
258351 GAAAAATTAA AGTGCATTTG ATGTGCTGTG TTTCTATAAT CATAAAACTT 
258401 TAAATATTTC TTAGAGATTT CCTAGTCTAG ACTCTCATTT TGCAGACAAG 
258451 AAAGCCAAGG ATTAGAAAGG AAAAATAGGC AAACGTGAGC ATTGTCCATT 
258501 CGCTCCTTTA AACAGGAACC ACACTTATGA TCATGGTCTT TTGAGATCTG 
258551 GGGTAAGTTA AAGAACATGT AAATTTTATT AATTGAGATG CTTTTCATTG 
258601 AAAGCATCAG AAAATAACAG AAAAATTTAA ATAAATTGGC TAAAAGATTA 
258651 AGAAATTATA ATTTCCCAAA CTGAAAAAGT CCCTTAGTAG AACACCTCCA 
258701 GGGTCAGTTA AATTGATGCC CAAATATCAT GGCCTAGAAC CCAGCCTCTT 
258751 TGTGTCTTTC TGCTTTGTTT TTCTCAAACT AACTTCTGCC ATGGTGATTA 
258801 CATGGTTTCT TCAGTTCCAT AATTGGCCTG CAGACATGAC AACTGACAAT 
258851 GTCCAGTAGA CAGAGTGGTT TCTTTCTTTG AGTTTCTTTT TAAGAAAAAG 
258901 TACCCCACAA CTCTTGTTGG TTAGAACTAA ATCTTGCACC AATTCCTTAT 
258951 CTAATTGCTT GATGATACAC AGGGTTGAGT GGAGAAAATA GAAAAGTAGG 
259001 TGACACCAGT AAATATTGGA TCCTGTTGAA AACCATCAGG GAGATTGGAT 
259051 GGTTTGAAGG TAGAAAACCA AAAATACATA CCACAGAGAT CAGCAAAATA 
259101 GATTTTAGAA TCTGACAGAC ATATAAGTTG CACCACACTG TGAAAAATAG 
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259151 GCAATGCCAT GATCCAAGCA CATTTTCAAC AAACGTGTAT TAATTTAAAA 
259201 TATAACATTG TCCCCGATTG CTTCCTTTAT AGCAGAGGCT TCTTTGATCA 
259251 CGTTTGAGCA AAAGGGCAGG TAGAAAGAAG TCTCGTCTCC TTTCATATTA 
259301 AATGACTCAG TTTCCAATCA ATTTTCATTG TGTTTTCTTT CCTTCATTTC 
259351 TTTTTTCTTT GTTAACATGT CATTGGACTT TCTTCTATTT GGGGAGCATG 
259401 AAACAAAATA ATGGTGATAT CCCTCATCAA ATAATTAGGG TTCTCAGGGT 
259451 CTATCACAAT TATGTCTTCT CTGCTACATC TAGGGATTTG AAGTTCTTAT 
259501 ACCTTCAGGA GCGTTGTTCT CAAACCTTCA TTGTAACACC TGAATGCCCT 
259551 CATGGTTAAC AGGAGTAACA TGAGGATTGT GGTTAGATAT GCAACAGTTG 
259601 ACACTGTGAT GACTGGGATG AGCAAGCACT CACCTAAAGA CATACCAGAG 
259651 CAAAACCCAT TCCCCCTCCA AAGAAATACC TTGATCTTTG ACTTTTGTTT 
259701 GCTATATTCA CAATCAAAAA , CTGTATACTA AATGGCTTCA TTTTATAAAA 
259751 GAGCCTTTTA AATTGTGTGT TCTGATATTT TGTCTATCCA TATGTTAATT 
259801 CAACAAATGT TAGTTGGATA TACAATAGAT ATTAAGTTGC ACACATGAGG 
259851 AGACCAAATG GGATAGCACA CAGAGCCTGT CTTTAAAAAG CTTACAGTCC 
259901 GACATCTTTT CTCTTCTCTC CTTGCTTGGG ATGAAGGAGA AAAGCAAGGA 
259951 AAAGTAAGTC ATTGATGACC ATAGCGTATG AAAAGTGTCA TGACAATGGC 
260001 AAGAAAAATG TGCTCTGAGA CAGCATAGAA GGGGCTCCTC ACCCCAACTC 
260051 GCAGATGAGT TTGGATCTTT GAACTGAAAC TTGATGGGCA AGCAATAGCA 
260101 GCTTGGTGAA GAATCGGGAG AGAAAAGAAT ACTCCAAATG GTGCTATATG 
260151 TGAAAGTACT CAGATAAGGG AGGGTACCAC CACTACCATC CTTGCTGTCA 
260201 TCATCATTAT TATTTCATCA CTACCATTGT TAAGTGCCCA AACCATGATG 
260251 CTTTTCTAGG TACTTTTATG AGATATCTCT AGTTCCTACA GTGACTTAGT 
260301 AGTAATTATT GTTCTTATTT TATAGATGAA AACATTAACT TGCCCAATGT 
260351 CACCCAGACT GTAAGAGGTG GAGATCATAC ATGAGTTCAG ATAAGCTTAC 
260401 TACAAAACTG AGCTGACTGC TCATGCTTTC TGCCTCCTAT TTGAGTGTCT 
260451 CCTAACTCTC AGGTTTCCGC CAGCTCTAAG GAACTAGTTT TTGTTACCCA 
260501 AGGTCCCCTG CTATTTTAAG CCTCTCAATT GAAGTTTCCT GTTTTCTCTC 
260551 ATGAGAATGC TTCCTGTTCC TTATTATCTG GCTGGTAGTT ACTGCTCCTC 
260601 ATTCGGGCTG AATTCACAGG GAATTTGTCT CATCTTGGAA GATTTCTCAA 
260651 CCCTTTTTTT TTCCAGCCTC AGGAAGACGA CTTGCTTTGT CCTTCCCTCC 
260701 CCCTGCAAAC AACATCACAT TTATGTCTAT GTATCTTTGT TATCCTATCC 
260751 CTTCACCCTC TTTCCTACCC TGTCTACCTT CCACTATATA TTGAATCCAC 
260801 CGCTAGACTG CAAGCTCCTG GAAGCCAAGG CAGCCTCTCA TTTGTCATTG 
260851 TGACTTCAGT GTCTAGCATT ATTCTTGACA CGGAATGAAG GCTCAAAAAG 
260901 AACTTGCTAA. ATAAATGAGT GAGTGAAGGC CATCCAATGT GGGGAATGCT 
260951 ATGTTACTGC TCCAGAATAT TCTTTCATAG TATTAATGTT TCATTTTATT 
261001 TGATATATAA TCACATGGCT TTTATTCCAG AAGACGATAT CACTATGAGA 
261051 AAGGAAGCTA AATACTTTGT TGGTTTTTTC AAGGGATTTT CTTTCTATTG 
261101 TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTTTGGG GGTGGTGGCA 
261151 TATGTATTTA AGACAATAAG TTAGAAGAGA TAAGAGGTGG GTTTTAAAAA 
261201 CGTAAATACA GGCCAGGTCC AATGGCTCAC GCCTGTAATC CCAGCACTTA 
261251 GGGAGGCCAA GGTGGACGGG TTACCTGATG TCAGGAGTTC CAGACCCACC 
261301 TGGCCAACAT AGTGAAACCC TGTCTCTACT AAAAATACAA AAATTAGCCA 
261351 GGCGTGGTGT CTCACGCCTG TAGTCCCAGC TACATGGGAG GCTGAGGCAG 
261401 GAGAATCGCC TGAACCTGGG ATGCGGAGGT GGCAGTGAGC AGAGATCATG 
261451 CCACTGCACA CCAGCCTGGG TGACAGTTCA AGACTCTGTC TCAAAAAAAA 
261501 AAAAAAATGT AAATACAGCT GTAGTTGGCT TATTGCTAAC TTATTTCATT 
261551 GGAGTTTGGC TGAAAAGGTT TAAAGTGATT TCAAGTTTGG AGAAGTTACT 
261601 TTTTTCCATA ATAAGAAGAA AATATCTTCT AGGCACCTAC TCTGTGCCAG 
261651 AAAGTCTCAT ACCCACAAAT TCATTTGTTT TTTTTTTTTT TTTACCATTT 
261701 TCCTGCTCAG TAAAGTAGTA AAAATCAACA GCTAACCAGC TGACCTTCAT 
261751 CTAGCCCTCT GTATTCAGTA TTTCTTCTAA TATTTCCAGA TAGGCATTAT 
261801 TCTCATTTTA CCAAAAAATT TTAAGGGCTT AATTTTTTGA ATGCAAAAGA 
261851 GGTAGAACTG GGCCTACTAC TCAGTCGTGC TTTGACTCCT TATGTCATGG 
261901 AGGCAAGCCC GTTGCATCAC ACCACATGGC AATACTGCTT ATGAAAATAT 
261951 CTAACTTAAG GGTACTTCCT TCTAAAATAT ACAGGCACTT GACTATCCCA 
262001 TCATTATATG CTGACAGGTG GGTCTTTTCA AAGTCTCACG TTTTCACGGT 
262051 CATCACCCCG CTCAAGTCCT TATCATGTAT TACTAATGGA GACTGTAAGG 
262101 GGAGGCCCAC CTGGAAACAT CAGAACTGGG AACTCAGACC TCAGTGAGGC 
262151 AGGAGAAACA ATAGAGAATC CAGAAAGCGC ATCTAATGGG AATACTAAGC 
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262201 AGGACATAAC ACCATAAACT AATTTCAAAG GATAGCTTAG GGTCTGGGAG 
262251 AGATGGACTA TTGAGGAGTA AGGACAAGTT TGCACAAATC TGGATGACCA 
262301 GTGATTTTAA GTCCATTTGG ACTAGTGACT TCTTAACATT AGAGGCCCAG 
262351 GCAGAGGCAT CTGTAGCAAT CTTGGTCTAG CCAGGATCAG GGTAAGAATA 
262401 TTAACAATAG CAACCATGTC AGAAATATAT ATGGGATTCA AGTATTTGCA 
262451 AAACATTTTA GATTTTTTTT GTGTTGAATG TCAACCACAG TCCTATAAGA 
262501 CACACATACT TATTATCAAC AGGATATGAT AACATGAAGA AGGAAATGTA 
262551 TGTTGTTGAA TAAAAATAGC AGTATATAAT GAAGTATATA TACACTGTAT 
262601 AGTTTTATAA ATTGCAACAC ATTTTGGAAA GGCAACTGCT TAAGAGAGTG 
262651 GTTGTGATTA CACGTACAAA ATATTTCAGG CATTTAAAGT ACTGGGCTGA 
262701 ATTTTTCCCA GTAATTAAAA TAGGGCATAA GAGGCAGCAT ATGTTCTAGA 
262751 GATCACTAGA GTACAATAAG CCAAGGGCAG ATATGAAGCC AGCATAAAGT 
262801 GGGGAAATAT AGGAACAAAT ATCCTATTAC AGTGTGTGTG GTCTATTCCT 
262851 GGACACTGTG GTTATAGAGT CCAAGTTCTG CTGCCAATGC CACAATAAGA 
262901 AAGCCATGGA CAAATGTAGA AAACAGTTAA TAAAAGTGAT GCTATCTATG 
262951 CCTGATCTAA CTCCATCTAG TGAGAAGTTT CGAGAAAGAG TTTGAGTTGA 
263001 GTTTGTGGAA TAGATTAAGA GTAAGATCAA GGAGGATAGA CTTGGATATA 
263051 TAAATAAAAA CTGAGTTCAT TCTTAATGTT CAGGAGAGAA CAGATCCTGC 
263101 GTCTGTTGTA TCTTACGCAG ACTCTGTTGG TCAATACAAA TAAACTATAT 
263151 CCATTTGGTT TCACTTGCTC TAACTTGGGA TTTCTCAACC TCAGTGCTAT 
263201 TGACATGTTG GGCCAGACCA CTCTTTGCTG TGGGTCTGTC ACTGGCATCG 
263251 TAGGATGTCT GGCAGCATCC CTGGCCTCTA CCAGTCAGTA GCCCCTTCCC 
263301 CAGCTGTGAC AATGAAAGGC CTCCAGACAT TGCTAGTGTC CTCTCGGGAG 
263351 CAGAATTAAC CACTGGCTTA CATGGTAGCT GTTGTTGCAG GTGAGGAAAC 
263401 TCCGGGACTA GGAGTAGGTC TGGGTCATTC TGGTCTCATG GCTTCTGTGA 
263451 ATTGGGAGCC ACTGTAGCAG TCTATTTTTG GGCCTTTCTG AAACTGAGGA 
263501 TGGTTGAGCC ACCCAAAGAA CACTCAGAGG ATGCAGCATT TGTCAATCAG 
263551 GATTTACAGA TGGAGGAATC TGGGCGGAGA GTTAGTAGTC TGCCCCCAAA 
263 601 TCACATGGAA GAACTGATAT TTATACCCCT GGGATAACAT TCTATACCCT 
263 651 GTCTCTTTTC TCTATAACAA GTGATTTTTA TTTTATGCTT TGCAAAGCCC 
263701 CAACACTTAG GTATTGAGTG AGAGGCAAAG CAGGAAGGAC GCTCCTGCAT 
263751 TTCTGCTCCT ATCCCTGCTC CTGTCTTGGC CAAAGTAGCA CAGCTATCAC 
263801 CATTCCATGC TGAGGATGGT TAGTGTTCCT TTTAAAATAG TAACAAATAA 
263851 TGTTCTCTGA ATTATTCCTC TATGCCTGCT TGGAACAGAA GACAGACCAG 
263901 TGTTAACCCA GAAAGCAACA CAACAATATT CCTTATGTGC TGTCGGAACT 
263951 CTCTTTTCAA GGGTAAGGCA GGCTGAGGCT GCAGGGTAAA TCTGAAAATC 
264001 AACAAGGTAG AAAGATACAG AGACTGACTC AAAAAGACCA TTTCATCATC 
264051 CATTCCTGGT CTCTTTCCAG AACAGTCATC TTCTTGCCTC CATATTTCAG 
264101 TCTTCCAGGC TATCCTGTGC TATTCTGCCA GAGTTTCTTT CCTGGCAAAG 
264151 CCTTAACAAA TAGTACCCCG TTCTTGAGAA TCTGAATTCT CATCCTGATA 
264201 GCATCTTGCT TGGCAATCTT CAAATTCTAT ATTAATCTAG GAACTTCTCA 
264251 CTCTACCCTT TTTCATAAAA TAGGAAATCA TCTCAGGGTG GGAGAATCAC 
264301 AACTTGACCA CTCATCTCTC CCCTATTTGC CTGTCATGAA ATCTAAGATT 
264351 CTGCATTTTG AGACTGATTT TTTTTTAAAT AAAGAGGTTG ACTGCTCTAG 
264401 TTATTTTCTT AAAAGCAGAT GAACTTGAAA ATTTTGACAT ATGTATATTT 
264451 TGTCAGCCAA AGGCATACCA TCTATTTCAG GAGGCAATAT TATATCCTTG 
264501 CTAACAATGT GCGCTTTGGA ATCGAACTGC CTCAATTCAT ATTCCAGCTT 
264551 GCCGCTTTTA GCAATATTAT CTGAGGCAAG TTTTCAAAAT CATCTGAACC 
264601 TTCTTTTCTC ATCTGTTAGA TAGGATAATA ATGTTCTCTA TGTTACAGTT 
264651 ATAGCAATTA AATGACATGA TACATGCAAA GTATTTTTAA AAATAATAAA 
264701 TAATAGCCAT CTTCATTATT ATTACTACTT TTTTTGAAGG AGTCATTGAT 
264751 AAAAGCAAAT GCACTATTGA AGAGATCATT AATAAAAGTC AATACAATGT 
264801 TGAAGGTCAT CTTGTCCAGT TTCACAATCA TCTCTGAGTT ATTGGGAAAT 
264851 GTGTTATAAT GAATAGATCA ACAAA7VATAT AACTACAGTA AGGCAAAATA 
264901 ATATGAAATG AATTACAAAG ACTAAATCAC TCTGATAAGA TTGATTGTCT 
264951 TCTGACTGAT ATTTTACTAT ATTCCTAACA TGGGTCTGAG AAAGAGGAGA 
265001 ATAATATAGG CAAATTGCTA CAATCATCAA TTAAGTAAAT TTCTAAAAAT 
265051 GTTAAGGAAT GTATAACACA AAACGAGCTA AAGAATTGTT TGCAGAATGA 
265101 GTGTTGATGA TATTGGGTTT GTCATGGTTG CCGTTCCCTA TATCATGCTG 
265151 TTCTTACCGT TGCTGTGACC TTCTTCTCTT GTTCAGCAGG ACATTGACAT 
265201 GCAGCATGTA ACATGTACTA CTAGAAGATG TAACCAGTGA TCTGGCTCTT 
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265251 ATGCCAGGAC AGATTTTGTT CTAGGGTCCT AAATTGGTAC TACCTTCATC 
265301 ACTGATTATA TCAGTCAGTG GTCCATGCTA TTAATACAAG CAGCATGACA 
265351 ATCTTAGGGG TAAGAAACAT GTTACAGTTT AGAAAGATCT GTACAAGGTC 
265401 TACTTACAGA GACTGGTGTC AAAAATTTTT ATGAGACTTG GGAAAATCAT 
265451 ATATATTTTT TCCTTTCCTC TTCCCTGGGT TTTCCTTATT CCTTAATACA 
265501 TGCAAAGCTG ATGGGGTCAG TTTGGGATCT CGAAGGGAAT TATCCCCTCT 
265551 GTATTACAAG AGGCCTCATG GTGAGCTATG CTTTGTTCAA AAAGGATGAA 
265601 AGCAATTTAA ATTAAATAAC AATGGATAAT CATCTAGTTT ATTGTAAGGT 
265651 ATTGCCACTT GAATACTACA GTTTTACTAT GGAATCTTTG TTTCAATAGT 
265701 TGTTTTTTAT TTTATATGTG ACTTACTACA AGAACTTACT CTGTTATTTT 
265751 AATGTAGCTA TGTTTTTATT TTTTCTTGAG TAGTAGTAAA GAGGGTTATG 
265801 TCCTGTTATC ATGGTGACTT ACCCCTGCCA TCCATTCCTG GGGAACTTGC 
265851 TTGGCTGCTA TGGTCCGTGT GGTGACAGGT TATTCCTTGG TAGACATGAA 
265901 GTGCAGAGGG TACAGTGACA TTAAAGCCCT TTTCATTACC TGAAATCTTC 
265951 TTACAAACCA TCTACCAAGG TATGTTAGAC CAAGAAAAAT GTCTTCTATT 
266001 TTCTTCTACA GATTATAACC AGCAATACGC TGAGATGATA ATGGGGCTGG 
266051 ATATGAAAAT AAATAGAGTA CTTTACTTTC TGAAAATCAT TCCATAGCAA 
266101 CCGAATGCAT TTGGGATGAT ATCTATCTAT CTATCTATCT ATCTATCTAT 
266151 CTATCTATCT ATCTATCTAT CTATTCTAAG AGCATATGTT TGTGTTCAAA 
266201 TGTCTGCTTT TCTAAAAAAG TATTTTTGGA AACTGACATA TAAAATGATC 
266251 AGTACTAGTA ATGCTCAATC CTTTCTTGTG TTTTGGTATA GTACTGACTT 
266301 TTTATAATCT ATAGACATTT TGCATATAAT TTCAGATTAG CTGATATAGC 
266351 TAGGGTTTAA TTACAATTAC TTCCTAGAGA AAAAGCTCCT TTGTCTGAAC 
266401 GGTGATAATT CTGAGTATAT GATTTAGATG TGCCTATTAT AACTGGGTGA 
266451 TTGAAAAAAA CTACTGAGTA AATTCACAAT TTAAACAGGA TGACACAAGG 
• 266501 TGTACATTTA ACTTATTTAT GACACAAATT TTTTCAAGCC TCCATTTTAT 
2 66551 GGAAAATATA GAATACTGTG TGTTACTTTT ATTTATTCAT TATAGGACTA 
266601 TTTTCTTTTG GGTCCTTCTT TACCTACTGT AAGCTTTGTA GCTTAGTTTG 
266651 CGTTACTTTC TGTATATAAA CTAATAAAGA GGCATATTTT TCAATAGACA 
266701 TGAGCCTTTC ACTAATTCAG AATGCCTATT TCTTTCATTA GGACAAATTA 
266751 GAGATTATGT TTTTCTCTGA ACAAAATGCA TTTTTATGTC AATTAAATGA 
266801 GAGTTTTTGA TGAAATGTTT CCTAACAGAG AGTAAGTAAT TGTGATATAA 
266851 TTGTTTGTAG ATGAAAAATA ACTGCCACAT TCATGGCAAA TGTGAGTTTA 
266901 ATGCAGATGA TTTTCATGAC TATGATGTGT TTTATGCGGC TTTTTATACC 
266951 CACTGTTTTG GAAGCATTCA TAGTACCAGT TAAGGCAAAA CCAGCTTAAT 
267001 GTTTGTTTTA GTGCTTGGCA ACAAATACAG TCTATAAAAT ATGTAGTAGT 
267051 TAATATTAAA AGCACATGCC AGATTGTAAC ATAAAGAGTT TATTCAATCT 
267101 CTTTAAAGTC ATCAATTTGA AGCTTCATCT GAATGAGTTA GCATTTGAGT 
267151 TCCATATGAT TTTCAGCAGG TGAGATCTTG TGAAGTTAGG AATTGGCTAC 
267201 CCCTGCGAGC CCCCATTGGC ACAGCCCCCA GTTTCTTAAT GAGCTTGCCT 
267251 CATTGCTGAG TGAGCAAAAC ACTCTTAACA ATGTTCCTTT TGAGCCATCT 
267301 TTGTTGCAGG GGGAAATAAA GAGAGGCGCC AAGCCTGCAA ATTCTTTCCT 
267351 GATCCCATTG GTTCACCAGG AGAGAACAAT TTTGTTTTCC ATGGTTGCGG 
267401 TCTTTCAAGG TTGCCTTTAC TTAGGATTTG ATTACCTGTC TCCCTCAATC 
267451 AATTAGCCTC TCCCACACAC TCTCTTGCCT CCAATAATGA TCAACCTCCT 
267501 TTTCTAAGAC CAGCCTCCTC TTTATCATAT TGCATTAAAC ACTTGATGAA 
267551 AGAAGACCTA GAAGGCTTTA AGAAAGACAG GGTAATATTT TGCTGGTGTA 
267601 GAGGGCCTGC TCTTCGGAGA CAAATACTTG CTTTTCATTT TGATCAGCCG 
267651 TCTGCACTTA AAAGCCTCCT AGCCAGGGAC ATGTTAATTA AACAGTCTGG 
267701 GGCTCAGTGC TGCCATCCGT AAATTGGAGG ATTAGAGAAA TAATATACAC 
267751 AAGTACCTGG TACATACCAA ATATGATAGA AGATATCTTC AACCTTTAGC 
267801 TATAAGAAGA ATGGAAAATG AAAAAGAGAC AGGTGGAGAG AGGGAGGGAG 
267851 ATGAGAGCTG CGTGGAGCTG GTATGCTGGT TAGGAGGTAG GAAATAATGA 
267901 GGCCTCGACC ATCTAAGATA CAGAAGTCTC CCTGCTGAAT CGAAAGGAAG 
267951 TGCTACAATA TTCTTTCAAA TTCATCGCAG TTGAGAGAAA ACTGGAGCAG 
268001 AGTGTGATAT CAAGAAAGGG CTTCACCAGG ATGATCAATG CAAATGGATA 
268051 TGGAGCTTTC TTGGAATGAA ACATTATCCC GTAACACCAG GTAATTTTCT 
268101 TTATTAAAAA ATACACATTA AAGTCTTAGA ATAGTTTGAT ATTTACCGAC 
268151 AAGTTGCGAA GATTTTACAG AGCCGTATAT ACCACCCTAT TGCTAATATT 
2 68201 TTACACTACT ATAGTACATT TGTCACAATG AGGAATAAAC GTTGTGGCAC 
268251 ATTACTATTA ACTAAACTCC ATACTTTATT TAACTTTATT ACTTTTCCCC 
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268301 TAACCTCCTG TTCCAGGATC 
268351 TCTCCTTAGC CTTCTCTGAA 
268401 CAACCTTGGC AGTTTTGAGA 
268451 CTCAAGTCGA TTTTGTCTGA 
268501 GGCTTTGGGG AGGAAAACTT 
268551 TAACAAGGGT CTATGCTATC 
268601 AGCTCACTTG GCTAAGACAG 
268651 TTCCCCCTGC TTCGCACACC 
268701 CTACCCTTAC GTGGTGGGGA 
268751 GTATCAACAT AAATTATTTA 
268801 CTCTTCTCCC ACCCCCTGTT 
268851 ACTTAGCTCC TGCATACTTT 
268901 GCATTTCCTT ACTTTCTGAT 
268951 ATTCCCTGCC TTGGCCCTAG 
269001 CTTTCATTGG AGAATGGTTT 
269051 TGGTCACTAC AGGGGATTCA 
269101 TAGGAAATCT ATATGTGCAT 
269151 TTATTTCAAT AGCCATCCAT 
269201 TTCACAATGA TGTTTCCTAC 
269251 TCTTCCTCCT TTGCTTGTCT 
269301 GGATCTAACT CTTATTGGTT 
269351 ATACATGAAG AGTGGCTTCA 
269401 AGCTTTACCA ACTAGAGGAC 
269451 ATCTTCTACT TTCCAATAAT 
269501 ACCCACCCTC TTCAGTGAAG 
269551 CTTTTTGTCG AAGTGTGCAT 
269601 ATTTTAAATT TGCCTATGTT 
269651 GAATTTTGAC AAATACACAG 
269701 GGAATACTCT CGCCACCCTA 
269751 TGTCTCCCTC CCCAAACCCC 
269801 TATTTCTGCC TTTTCCAGAA 
269851 GCCTTTTCAG ATAGGCTTTT 
269901 TCCACATTGT AGCATGGATT 
269951 TATTATCTGT GGATGTGTCC 
270001 TCGGTTGCTT TCAGTTTTGG 
270051 CAGGTGCAGG TTTTTGTGTG 
270101 CTAAAATTGC AACTTATGGG 
270151 GTGGCTATAC CATTTTGCAT 
270201 TGGATATTGT CTTTTTATTT 
270251 GTAATGGTAT CTTCTTGTTT 
270301 CTGAGTATCT CTTTATATAC 
270351 GGGAGTCTGT TCAGTTATTT 
270401 TTATTGTTGA TTTGTAAGAG 
270451 CGTTTATTAG ATATGGTTGT 
270501 TCTTTTCATT TTCTTAACAG 
270551 TTTATTGTAT TTCTATCCTC 
270601 CAGAGCTGTT CCACTTACTG 
270651 CTAAATAATT GTCCATTGGC 
270701 GCTACTTGTG AGTCTGAGGC 
270751 ATTGACGTGA GCCGAGATCG 
270801 TGAGACTCCA TCTCAAAAAA 
270851 TATATTGTAT TGAAAAGAGC 
270901 GTTAGTTTTC CCCGTTCTCC 
276951 TCACACAGAA GTCTCTAACC 
271001 AGGGGAACTA TTGCGTTCTG 
271051 TCAGAGACCT CGGTTCAGAT 
271101 CGTAGAAAAG TAACTTAAGT 
271151 ATGAAGATAA TATCGTCCAC 
271201 CGAAGTGTTT AGCACAGGGC 
271251 GTGCTTACTT GCTTTTCCCC. 
2713 01 GATTTTTCCC AGTGGAACTC 



CCCTCTAACA CATGACATTT AGTCATTATT 
CTGTGACAAT TTTTCAGACT TTCTTTTCGA 
AGTACTGGTC CCTGATTTTG TAGGATGTGA 
TGTTTTTCTC ATAATTAGAC TGGGGTTATG 
CAGAAGGGAT ATACTATTCA ATTCATATCC 
AACGTGACTT ATCACTGATG CTGTTAACCT 
TGTTTCAGGT GTCTCCACTA TAAAACGCTC 
ATACTCTCTG GCAGCAATAG TTGGCACCGC 
ATTATTCTCT ATCTCCTTGA GACAGGGCAA 
AAATTCTTCT ATTCGGATAT CTGTCTCTTC 
GACTTTATTC AATCACATAT TTATTCAATC 
TAGCCTACAC CTATCATTTT TGTTTTGTAA 
ACCATCATAA GCTCCAGGAT GATCTTGTAT 
AATCAGCAAT TTCCCCGAGG ACTCTGGTTT 
TTGAAACCAA GATCTTGGTG CTGGGTGTGC 
TCGCTTCTAG GCCCTCTCAG TGGACAGAGC 
ACTAACCCAT GTATGTATAC ATATCTATAA 
CTGTGTCCAT ATTAAATATT TAAGCATGAG 
TCTAGTACTG TACCACATGG ATCATTCTAG 
GTAACCTCCC ATTTGAAAAC TGAGAAAATT 
CATTTACTTA CTTTTTCCAT CCAGGTATAC 
GGACTGGTAA GTTATGTTCC TTTGAAAAAT 
AGTGCTCATG TGCAGTTCCT TTGCCCTTTT 
TTCCAAAGTT ACTTAGGTCA ACAGTCTTTA 
TTATGTTATA TACTTAGCAT ACAGTTACAT 
TTCATCCCAG GATCTCTTAA TCTACTAGTT 
AAATTTATTC TCTGCTCTGT AAAGTTCTAT 
TGTCATATAT TCATCACTAT GGTACTGTGT 
AAAGAATCTC CTGTGCTTCA CCTAGTCAAC 
TTCAAATCTC TGATCAGTTT TCTGTCCATG 
TGTCGTGTGA ATAAGATCAT AGATTATGTT 
TTTCATTTAG TAATATGCAT TTAAGATTCA 
AATAGCTCAT TCTACTTTAT TGCTAAGTAA 
GTTTGTTTAT CCATTCACCT TCTGGAGAAC 
GTGATTATAA ATAAAACTTC TAGAAACATT 
GACATAAATT CAAGTCAATT GGGTAAATAC 
GGATATGGTA AGTCTAAACT GTCTTTCGAA 
TCTGACCAGC AGTGTGCTTC ACCAGCAGTA 
TTTTAAGATT TCAGTCATTC TAATAGGCAC 
TAATTTGCAG TTCTCTAATG ACAAATAATG 
TTATTTGCTA CCTGTATATC TTCTTTGGTG 
TGCCCATTTT TAACTGGGTT ATTTATTTTC 
GTTTTCTAAA GAAATATTTC AGATACATAT 
TGAAAATATT TTCTCCTAGT CTGTGGCTTA 
TGTCCTTCAC AGAACAAAGG TTTTTAATTT 
TTTTTGTCCA TGGTTATATC CTCTACTATA 
TTAGAGCAAA TCGTATTAAT TTCTGTTGCC 
CGAGAGCTGT GGCTCATGCC TGTAGTCCCA 
GGGAGAATCG CTTGAACCCA GGAGGCAGAG 
TGCCACTGCA CTCCACCCTG GATGACAGAG 
ATAATAAATA ATAAATAAAT AATTATCCAT 
TGCTATATAT GGCATGATAC CACTCATTAT 
ATAAAATGAC CACTGTGTCT GTCAGAGTTG 
ATCTTAACAC CTGAGCATGC CCAGGACTGG 
TATAGAGTAC AACAATCATG GGCGTTGGAC 
TTTGCTCTTA ACATTTCCTG GTTGTGTGAA 
TTTCTGAACT TTGGCCTTCT CACCTATAAA 
GTGTCTTAGG TGCTATTAGA AGGACACATA 
CTGACGGAGT GTAGCACTCA GTAAATGTTA 
CTTGTGCTCC ACATTTTCTT TCAGCCTCCT 
TTTACCTTGC ATCCATCATT TCTATTGCAA 
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271351 TGCATTTCTA TGCCTTTTCG AGGAGCTTTT CTATTCCATC TACACTGAAA 
271401 ATGGCTCATC CTATTGATAC AATTTCCCTG CTCCCAGCCA ATGGCTGCTG 
271451 CAGCACTTCC TGATATCTAG ATCACAAAGT CCCTGAACAG GTTAGGGTAG 
271501 CACATGTTAA AAAAACAATA TTCCTGCAAG TTCACCTTTC TCTTACTTCT 
271551 CATTTCCAAC CTAAGATCTT TTCAATGAGC TTACTCTGAT TGAATATAAA 
271601 AACAATTTTT AAATAATTCA ACCTAAAATA ATAGAAGTAT TTTTTGCTTT 
271651 TATTATTTGT AATTAAGTTA TAATAATTGT ACACATTTAT GGAGCACAGT 
271701 GTGATATTCT GATACATGTA TGCAATGTGT AATGATCAAA TCAGAGTAAT 
271751 TAGCGTCTCC ATCACTTCAA ACATTTGCCA TTTCTTTGAG TTGGGAACAT 
271801 TCAAAATCTG TTTTTCTAGG CATTTGAAAA CATACAAGAA ATGTTGTTAA 
271851 TTATAGTCAC CTACAGTGGT ATAGAATGTT AAAATGTATT CCTTTCATCT 
271901 AGCTGTGCGT TGTGTTGTTA ACCAACCTTT GGCTGTGCTT CCCCCCACCC 
271951 CTACCTGCCT CTGGAATCAC TATTCCACTC TCTACTTCTG TGAGATAAAC 
272001 TTGGAAAGAA ATAATACTAT TAGGATGATA ATTTCCCACA AATTCATCTA 
272051 TAAATTTATT GCAATTCTAA TTAAAAATCC CATATGATTT ATTATGGACC 
272101 TGTCATACTA AAACTTGTTT GGGAGAGAAT ATGCTGGAGA ATTGCCAAGA 
272151 AATGTATCAG AACATTCTAT AAAGCCACAA TAACTAAGGC AATCTGATAT 
272201 TGACAATGTG TATTTTATCA ATGGAAAATA ATAACATAAA CATATATGTA 
272251 TTTATGTGGT TTGTTCATAT ATATGTATTG TGTATGCATA GTTGTGTGTA 
272301 TATATAGTAT TTATATGCAT ATATTTAAGC ATAACAATTC AAATCATCAT 
272351 GGAAAGCATG GACCATAGCA AAGGGTCTTT ATTACACAAT AGTTGTGAGC 
272401 ACTGGCCCTG GCACCAGACT GCTTGCTTTG AATCTCGACT CTGTTTCCGG 
272451 GATTTGAATC TGTCAGAGCT ATGTGACCTT AGGCAGGTTG TTTAATTTCT 
272501 TGGAGCATGG CCTCAGCTTT CTTTTGTGCT GAATAAGAAA ATAATAGTAC 
272551 CTAACTCCAA AGTACTTAGA AGGGTTAAAT GAATTAATGT AGGTAAGGAG 
272601- CATAGAATAG GTGCTGGCAC TAAGCACCAG CGAATGTGAA CCATTCTTAT 
272651 TATTATCATC CATAATATAG AACGTAAGCA ATTGTCTGTT CGCTTTGAAA 
272701 AGTAACCCTT GGCAGGGCGC GATGGCTCAT GCCTGTAATC CCAGCACTTT 
272751 GGGAGGCTGA GGTGGGTGGG TCGCTTGAGG TCAAGAGTTC AAGACCAACA 
272801 TGGCCAACAT GATGAAACTG CATCTCTACT AAAAATATAG AAAACTATTA 
272851 GCTGGGCATG GTGGCAGATG CCTATAGTCC CAGCTACTCC GGAAGCACAA 
272901 AAATTGCTTG AACCCGGAAG GCGGAAGTCA CAGTGAGCAA GATAGAGCCA 
272951 CTGCCTCCCA GGGCGAGAGA GGAAGACTCC ATATCCAAAA AAAAAAAAAA 
273001 AAGGAAACCT TTAGTGACTA GTTTAGAGCC CTACACTGAC AGGTGCCTAT 
273051 CCAGTTGCTG ATTTCTTGTG ATGTGGATTT ATTTTTACTT CATAAAATAA 
273101 AGAGGCAACC TTACACTTTG TTATTAAAAT AGTAAGTGGT AAGGAACCCT 
273151 TTAAGGGATA TGGTTTTAAC ATATTATTCA AAATAGGAAT TGCAAACTCA 
273201 ATTGCTTAGA GAGGTCAGCA AGTAAATCAA TGTACTAGAC TGGGTAGAGC 
273251 GTGTGGTGAA ATGAAGGCCT TCTAGACGCT TAAGAAGACA ACCAGTTTTC 
273301 AACACCACCC AATTATTGCC AGGCAGAAAC ATGCGTGCAC ATTTTTTAAG 
273351 ATTTTTATTT TCATGCAATA TTTTCTGGCT TCTAAATGTT GACAACAAAT 
273401 CCAAAGTTTG . AAGCATTGTG TGGATCAAAC AAAATAGATG TCTAGAGATC 
273451 AGAGTTGGTT CATATTGTTT GTTGCAATCT GAGACTTAGA ACCTCCTTTA 
273501 GGTTACACTA CCCAGTCTTT CTCCCAACCT GTGTACCCCA ATTGTACCTT 
273551 GCACAGTGCT GCCCAGGTGA TAAGGAATCA ATGCTTGTTT TTTGTTGAAT 
273601 TAACTTGATT TTAATTCTGA ATCTTTCTGA TTCCAATCAA TGCTTTGAAT 
273651 TAATAATACT AGCTCCTGGC CAGGCACGGC GGCTCACGCT TGTAATCCCA 
273701 GCACTTTAGG AGGTTTAGGT GGGTGGATAG CTTGAGCCCA GAATTTGAGA 
273751 CCAGCCTTGG CAACACAGAG AGACCCCATC TCTACAAAAC AAAACAAAAC 
273801 AAAACACCAC AAAAAAACAT TAGCCAGGCA TGGTGGTGAA GTGCTGTAGT 
273851 CCCAGCTACT TGAGACGCTG AGGTGGGAGA ATCACATGAG CCCGGGAAGG 
273901 TTGAGGTAGC AGTGAGCTGT GATTGCAGCC ACTGCACTTC AGTTGGTGCC 
273951 ACAGAGTGAT ACACCATCTT AGTAATAATA ATAATGGTAA TAGTAATAAA 
274001 AGTGATAGCT CCCTCTGTTA GCCATTTATT TTAAATATCA GGTGCTGGTA 
274051 GGGTCTTTAC ACATATCCCT CTGATCATTA CAATACTCCG TAAAGGTTGC 
274101 TTTTCTCACT TTATAGATGG GAAAACTAAG AGGTTCAGTT ACCTTTCTAA 
274151 GGTCCAAGGC TAGTAGAATT AGTGATATGA CATTGTCTAG GTCTGCCTTT 
274201 TTGAGGGAGT GAGATGTCAA AAGTAGTAGC MGAGGTCCA GATCCCTCTA 
274251 GGGTGTAGTG TTTCCTGGCA TCAGCTAACT GGGAATACTG ATCACGTCTT 
274301 GTCCTGGAAA CACATTTCAT TGGATGACCT TGTCAGGCTT ACAATGCAAA 
274351 AAAAAAAAAA AAAAAAMAA ATTACAGGAT AAAAGATGAA AGATGTCTTC 
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274401 CACTTACAAA TGGCAAGCGA TAGCTATCCA AAGGTTCATT TGTAAATCAG 
274451 TTGTTTGAAA CTCGGAATTC ATTTTCCTGC AGAAACAGTT ACAAATGGTG 
274501 GTTATGTTTC CCCTGAGGCT CATCCACAAA AGACCATTTA GCCTTAATGT 
274551 ATCTGAAATA GTGTACATTT GCTGTGAAAA TGTGTAGAAA ATAGAACTCT 
274601 TGTAATTCTA GTAA^TAAAT AAAAGAAAAA GTAAAATTGA ATGAGAACTT 
274651 TAAACTGCAT CTTGAKTGCT GCTTCGAAAG AAAAGCAGAT TTACCCAGAA 
274701 GAGGATGCAG AACTTTGAAA ATAATTTTGT GGGGATTTTA AAGGGGGCTT 
274751 CTGAATACTT TCACAGCACT TGAGAAGTTA ATGAAATCAT ATCATTGAAA 
274801 TATCGCATTT CATTTCGTGA TTTCTAGATT TTTAACATAA TCTTTCCATG 
274851 ATTAATGCTA TTTCTAGTGT TCTCTTTAGA ACAAGAAGAA GAAGGGAAAA 
274901 AAGGGTGGAA AAGGAAAAGA CAAATGAAAA TGGGCTTCCT AATGATGCAG 
274951 AAGTTGACAG AGATAGCATG TGAAAGAAAA TTAAAGATCA TGTTTGTTTT 
275001 TCGATAGGCA TGCAAGAAGC AAAATTAGCC ATGAA^TATG AACAGTGGAA 
275051 GGAGGAAAAC ATAGATTCTG ATTTCACTTG AGAGAGACAA AACAATGCTG 
275101 GTGGCAGAGA TCTTTTCTCT GAGAAATTCC AGAAACTGTT ACTGTTTTAT 
275151 TCCATCATCT TAGGGCATCA TACTTCATGG CAAATGTAGG ATGGGTACAG 
275201 GATTGAGATC AAGAGTGCAA AAGTTGCTCA AATGGACTAA ATGTTGAGAA 
275251 TACTAAGTAC TTATTAAACA AGAGAGGATT TGTTCTCAAA CAAAAGTAAT 
275301 TATCATTTTT TATTGTCTAG GTTTTATCAC TCATCTTGTT TAACCTTCAG 
275351 CCAGGAGAAT GAAAGACAGA AGTTTTTAAA AAATATATAT ATATTGTGGG 
275401 CGGGGTGTGG TGGCTCATGC CTGTAATCCC AGCATTTTGG GAGGTGGAGG 
275451 CAGGCAGATA ACCATACAGG AGTTTCAGAC CAGGTTGGCC AACATGGCAA 
275501 AACCCCGTGT CTACTAAAAA TACAAAAATT AGCTGGGAGT GGTGGTGCAT 
275551 GCCTGTAATC CTAGCTAGTC AGGAGGCTGA GGCAGGAGAA TGCCTTGAAC 
275601 CCAGGAGGCC AAGGTTGCAG TGAGCTGAGA TTGCACCACT GCACTCCAGC 
275651 CTGGGTGACA GAGTGAGACT CCATCTCAAA AAAAACAAAA ATACCAAATA 
275701 TATGTATATT GAATGGCCAA ATGGTTGAAC AGGAAATTGA GGAAGAATGA 
275751 ACAACCTCCC AGACACTTCC ACAGTGAGTT TGCAAGGAGC CTAAAAGCTA 
275801 GCTGCACAGA GCAGAGGCCC TTCCTGCCAA ATGTAACCTT CTTTAGAAAT 
275851 AGAGAAGAGA AAGACAGTTG GGGGACAGGG GTTATTATGG ATATAGGACA 
275901 AAAACAAACC AGGGTAAAGA AACCAAAAAC CAACAAGGCC AAACCAATGG 
275951 ACACCTTGAG GTGATGTGGG CAGACACAGA GTGTTTCCCT TTGTTGTTCT 
276001 TTGGAGAGTA AGATAGGCTT CCACCACGGA CCCTCTTGTC AATAAATCTT 
276051 CCTGGGTTAA GTGGTTAGTT AGACTAAATT GCTCTTCCAC CAATTTGAGC 
276101 TGACAGCAGC ACAGAACCGT CAAAACCATT GCTGTGTTGT CTCCATAGCC 
276151 AAAGAGTGTT GTTGATTTAT TTCTGTTAGA GGAAAATTTT AACTTGCCTA 
276201 CCTTTATTTT TTTTAATCAT TTGAATTTAA GACACTAGTT CCCAGTAGGA 
276251 TTCACTATGA CAATTTTCAG TGATGAGAAA AAGCATCAAG GGTTTCTGGT 
276301 CTCTGAAGAA TTGTGTTCTT TTTGGCCAAT AAAATTAAAA TGAAATAGAT 
276351 TTAAATTGTA TTAATTCCTA ATTTAAAGCT GAAGTTTCAA GTCACCTTCC 
276401 ATTACTATTG TATAGGATTT CTACATAAAC CTATTGGGAC CTGAGACAGG 
276451 CTAAGCCTTA CTAAATTTCC TGGCTACAGG GATATTTCTT TTGGGGATTG 
276501 GAGATGAGAA GGGGTAATAT CGACAATCTC ATTTTCTCCT TAATTTTTCC 
276551 TTGATTGAAC TGGTGTCCAT TTGCTTAACT TATTCACTGT GTGTTATAGA 
276601 AATTTACAAG TTACTTTGCA AGTGTAGCTA TATTCATGTA TGTGCTATCT 
276651 TTTCTTTCTT CAATTAAATT GTGAGCTCTT TTAAGTAAGG AATTATATTT 
276701 TATCTTCTCC CACAATGACC AAAGCCAGAG TATCTTACTT AAGACACTTG 
276751 AAAGTGGAAT ACAAAAAGTA TTTTCATGCA GTCCATACTG CCTTGTTTGC 
276801 TAATTGCTGT ACACATGCAG ACATCATCTC ACCAACTATG CCCTAAGCAC 
276851 CTCATTGGTQ GAGTCTTGTT TTGAGGTAGT GGTTGTAACA GTTTGGGTCT 
276901 TTTATTAGTG CATTGTAGGG ACAGAGTAGC CTCTCAACAA ATAGTTGTTT 
276951 GTTAAATAGT CAAGTTTGGA GGGCAGTAGA TGAGACAAAG TAGAAAATGA 
277001 GAGCTTTTGC ATTAAACTAT GACAACTAAA ATCTATATTT AAAGTTTACA 
277051 ACCTAACTGT ATGTTCTTTT TAGTACATAC CACATCATTA AAACTTGATA 
277101 AATAATGTTT CCTCAATACT GTTACTTAAT AAGTAATAAA AATATGATGC 
277151 TGGCAATAAA AGCTGTTATA TTGTTACTGC CTTACCTCCT GGTTGGTGAG 
277201 CAAATGCTAG AAAGAAAAAA TGGGAAATAG TTTAAATTTG AGAGTTTTGG 
277251 ATCATTTTCT TGGCAAAGTC AAGAGACCAT TTGCTTCTTT TAAGCAGTCT 
277301 AAAAGGTGTC AAAGGATTAG AAGCAAAGGA AATATCAGCA TTGCTGTGCT 
277351 GTTAGACAGA TTCCTAGAGA ATCAAATAGG GATGGTAACA AGATTGAGCA 
277401 ACCCATCTCC TTAGAATTTT GCTTCCCTTT ACACCGACAA CCTCCCTATT 
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277451 TTATTTTTAA TTACCCATAG AAATAAACAG GAATGCTAAA ATATGTTTTG 
277501 AAGATAAATT CTATAATCAT TTCAAGTTTG ATTGTTGATA AATATGATAT 
277551 GGATGAATAT AAAGAGTAGA TTTTCAATAT GAAAAAAGAG TCTCCTAAAA 
277601 TGACGTGGGT TGTAMTTGG AGATGAATTT CTCATTAGTT TTAGTGCCTT 
277651 TTGTGTCTCC TATTGTCAAC TTAAATGGTC TCACACAACA CATCTGTTGC 
277701 TAGGCTCATG AGATGGAGCA GAGTTCAACA GGTACATTTC AAGGCCAGGA 
277751 AGGCTGATAA ACTGAAGGTT CAGTAGGTAA GAGAAGCACT AGAAACATAA 
277801 TGATCAGTGA AATTTAGTAC TATCAAGAGG CTAGAAATCC GGTTAGAAAA 
277851 CAAACCACAG GATTGGTAGG ATCTGAAAGT TGTCACAGCT TCAACAGTCA 
277901 GTATTGCCAA AGAAAATAAA ATGGGGCACT TTTTAAAAGC AGTAAACTTC 
277951 GATTTCACTC AGGAATAACT ATTGCAATGT GGGAGAGACC TCAGCATAAC 
278001 AGCTCAGTCC AACGAAAGGC AGGAGACTGT TTAAATGCTG GGGTGTACCC 
278051 AAAGAAAGGC ATTAAGGGAG AGATTGGTCC ATGTGATTAG GCTATCTGGG 
278101 TTTGCTAACT AGTGTTCACC TAGAGGAGCA AACTTCTCAT ATCTTTTTGA 
278151 CAGAAATTAG TGGTGCAAGT TGGAGCAAGG CATCCACTGA AGTTAGGTCC 
278201 TTACTGTCCT ACAGGGACTG GGAATGAGAA CTATTGCCTT GAATGTTTGT 
278251 ATTTCAAGGA GATGGCTCTC ATGTCCTTGA GGAGCCACTT CTGGATGGTA 
278301 AAAGATTTAT TTCAATGGGG GCAGAGAAAG GATTTATAAT TGCAAGCTTT 
278351 CTAAAATAAG TGCTTTAAGA GGGATTCAAG GGCCTATTGG CCTATCACCA 
278401 GGTTTTGGCT GGAACAAACA GTAAATTCTC CTGGCAGCAT TGAGCTTTCT 
278451 CAGAAAGTCA TTTTAAGGGG GGTTAGTGCG AGAAAGACTG AAAAAGAAGT 
278501 AGTCCCAGAA CTGTTCTCAC CTTCGATGCA GGCTGCGTCT CAGCCTAATA 
278551 TTTGTCCTGA TATTATTATC TCAGGGGACT CACAGTCAGG AAGAGTTGGT 
278601 AAAGAGGTTG TCAGACTTTA ATCTCCCCCA ACTTCTAGTA ACTTACCAGA 
278651 AGATGCCACA AAGTTTAGTG GTGATGGAGA CCCCTCTAGC TAGAAGAAAC 
278701 TTTTGGTATT TGTTTTTATT CTATTTGTGA TGTGTTCTAG CATTGTGAAA 
278751 TGAGGCAAAA AGAAATGCTG AAGCTGGTTC ACAGAAGTAA GTTGGTGTAA 
278801 AATCCTATGT GGTCCAGGTT TATCTCAACA ATACAATCTC CTCATGCCGT 
278851 TCTACCTTAG CCAATCTCCT CCCATCACCA TAGCAACCAC ATCTTATAAT 
278901 ATGTGATTCC AACTTAGAGT GCCTCATCAC TGCCCTTTGT ACATGCAGAT 
278951 TGGGAAAGAT GGAAGAGACT GTTCTTGGGA ATAGCACACA GCAATCTGGG 
279001 CCAAATCTGG GTGGTGTGTG CCCCACTATT TGCCAAGCTT ATTATCATTT 
279051 TTTTTCTCCT TTGGCCTTAT TCAACTAGAA GTCCTCTGCA TCTACTAGAA 
279101 TTCACTACAT ATATAGCCCC AATCTTCTTT CTTTTATCTC TGGCAGTGAA 
279151 CAATAGAAGG CATTTAAGAG CATGAAAAAA TAAATAATAA GTGCAGTATG 
279201 AGGTGTATGG TAGCCATTGC TATTAAAATT ACTCATTCAC TTTTCTATGA 
279251 TGAATGAAAC TCTAGTGATA AATAATAATG GTGCCCCTGA AAAAATTAGG 
279301 TTTGCAATTA GGACAATTTC TGCCTTGTAG ACAAACTGCC TAATATTATG 
279351 CTTCAAGACA GTTATTTACC ATAGTTATAA CCCAATGTGT CCTGCTGAAA 
279401 TGGCTTGTTT TTCCTACAAT GTTAAAGATT TAATTTTTTG AAGATCAGTT 
279451 TGTTTGAAAC AGCAGCTGTA CTTTTAGGAA CATAACCTGA AGAAATGTAG 
279501 ATGTAGATGT AGGTTAAAGT GAGTTCATCA CAGGCAATCG AAGATGTAGT 
279551 TTAAAGATGT AGAATTGTTG ATAGTAGTAA AAATTGTGAA CAGTCCCAGA 
279601 TATTGAACCT TAGAGGACTC ATGAAGTTAC ATAAGGTAAA CACGTGCAAT 
279651 AGATACAATG TAACCTTTAA ATTGATGGTA ACTGTATAAT ATTTTTGAAA 
279701 TGGAAAGATT TTTATGATAC ATTTTTAGTG AAAGAAGTAG CTTGCAAGTA 
279751 ATGTAAACAC TGACATTTAG TTATTAAAAG ATACATATAC ATATTTGTAT 
279801 ATATTTATAC TTACATACAT ACATACATAC ATACACACAC TTATATACAT 
.279851 TTGGCTAAAC AAAAGCTTAA AAGAATGATT TGTAATTTTG TGAATGCTTT 
279901 TGTTATTTTT CGTTGTTGTT TTTCTGAGAT AGAGTCTCTC TCTGTCGCCC 
279951 AGGCTGGAGT GCAGTGGCAC AATCTCGGCT CACTGCAACC TCCGCCTCCC 
280001 AGGTTCAAGC FATTCTCCCG CCTCAGCCTC CCGAGTAGCT GGGATTACAG 
280051 GCGTGCACCA CCAGGCCCGG CTAATATTTT ATATCTTCAG TAGAGATAGT 
280101 .TTGCACCATG TTAGACAGTC TGGTCTCAAA CTCCTTACCT AGTGATCCAC 
ofno^ CCGCCTT ^GC CTCCCAAAGT GCTGGGACTG TAGGCATGAG CCATTGCGCC 
280201 CAGCCGCTTT TGCTATTTGA TAATTATTTT CCTAATAAAA TCCTATACTA 
280251 ATGGTGTAAC AACCAAAATA AAAAAGAATG AGGAACGTAA AATTGCAGTT 
280301 TGCTTTTCTT ATTCATTATG TCAACAAAAC CTTAGCCAAC CACACTGGCC 
™^ CTTCTTCTTT TTCCTGGCCA AGTCAACCCT ATTCCTATGA ATGATAACTG 
280401 GAGAAAATGT GCCCAGTGTG GGGAACAAGG ATATGTTGGT GTATACCTAC 
280451 CATGGTATCT TTCTCAGGGG ACTGAGTAAA TACAGGGCTC TGTGCAACAC 
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280501 AGGAAGAATT CTTAAGGGGC ACCTCAAATT CCAAAATTCA GATCCTTTAA 
280551 AGAGAGAGGG AAAATCAGTA CTGCCCTCTG ACTTGGAGTG AGAATATACC 
280601 CTGCTGTGAA GGGAAGAAGG TCATCCTAAG GTGGGAAAAA CTAAAGGTAA 
280651 GTACTGTACA TATGTCTGGG ACACAACTTT GACCTTGGGG GAGAAACACA 
280701 CCCATTACTG AGTAGCTTTG TCCTCCACTT ATAAGGCATT GGATAGATGT 
280751 TCAAAATGGG GAGCTGTACC TATGAAACTT TJTGAAAGTCA TGGTAGGGGC 
280801 TATATTCCAG AAGGGCTGCA AACCCTAAGA GATGGGTCAT CTGCATTCTG 
280851 AGCCTTGACA GTGAACTGAA AGAGTGGTCA CCCTCTGAGG AACCTGGAGG 
280901 CCTTGCCATG CATGGCGCAG GGGCTTCTGT GCATCAGCTA TTTGTTCTTG 
280951 AAGCAATCTG GTCAGCTTGA AGAATCTGAA TATCATAGAG GGAGAAAGGC 
281001 TTCTATAGAG CATTAAAGCT TGTGACCTGA ATTCCCAATG TATGCATTTG 
281051 TTGGTTTATC CTGAAGAAGA GAACTTAAAT AAGATCATCC GCACGGCACA 
281101 GTTTACACAC AAGCATGTTA TTTTTCTGTT TTCTCTGTTG CACTCCCAGA 
281151 TTTTTAAAAA ACACTTACCA TGATATCCTT CAGGACTCAC CTTTCCTGGC 
281201 ACAGAGGTTT TTATAATAAA TAGTAATGGA ATGAATGAGT GAGGGTGCTA 
281251 TATAATATTT ACCGATATCC TACTTTTATA TATCAGGCCT CATATTATGG 
281301 TGCAGGGCAG CACAATTTAT ACCCAGTTGT TATTGCCATT TATATTCAAA 
281351 AGTATCAAGC TTTGTCATTT TGCAAAAATG CTTTTCAGTC TTGGAATTTT 
281401 TATTTATTTA GACTGGTTTT CTCTCTCTCT CTGCCCCTCT CTCTCTGTCT 
281451 CTGTTCTTCT CTTTTCTCCC AGGTCTAATG TATCAGAATC ACCTGGGATG 
281501 ATTGTTAAAA ATGCAGAATT CAGGGATCCA GTGCCCAGTC ATAATCTCTA 
281551 GGGCTGGGAT CTGAATATTA ACATTTTTTA AAGACTGCAG CTCATTCTCA 
281601 TGGACACTAA ATTTTGAGAA CCATTTGTAG TTAATCTATT TAAAGAATCA 
281651 CAGTTTTATA CATCGTCCGT CTGAAAATGC AAGGTATCAC CTAGAAACTT 
281701 TTAAGACAAC TTTTCCTCTC CTAGGAACTG TAAAATAACA AAGTAGACAA 
281751 GTGGAGAGTA AGGGTGATAG AGGTGGGTGG ATCAATAGGA ACTCTCTTTC 
281801 CTGTGAGGGT AACTTGATAG TGACTAGGCG GTGCAGTTGG AGTGAATTGA 
281851 TTTTGTTCAA ACAAGTTTTC CTGTTTCTCA GAGTTCACAG TGAACATTTT 
281901 TTTTGCAGGC TTCTTGCAGT GTGCTGTTTC ACAAACCATC AGAGGCATTT 
281951 AGAAGGGCCA GGGTTTTTCT TCTAAGGCTT AAGAACTTTA AATCTCTTGG 
282001 GTGGAAGTAG TTTTCAGTGA CTTGTTTGTT CTTGTGATAT AAGACATAGT 
282051 ACCACTAACC CGTTGAATCT ATCATTCCTA CTTATTGCTT TAAAGCCAGG 
282101 ATAGTATCAA GTTCTTGATG TACTTTACTA ATCATGGCTT TCATTTTCTA 
282151 ATGTGTATTT AATGGATATC TATTCATTTA TCATTCTTAA GTACAGAACA 
282201 CTAAAGAAAG CATACTGTAA TCAGAGATGT TATCAGGCTC AATTATTTTA 
282251 GATTAACTGG GTATTGTTAT GACAATACTC TGTGATATCC CTATCTTCAA 
282301 GTACAAAGTA GGCAAGGAAA AAAATGCACA TCAAATAAAA AGTAGTTGGC 
282351 AAAACTACCT TTTGATCCAT TTTCTCTCTA TTAGAAATAT GAATGGCTCT 
282401 CAAATGTTTT ACAAAGATTG TACATTGGAC ACCAATGAAG CTGTTCAACA 
282451 CTAAGGCCCA AAAAAGAAAC AAAAAAATAT GGTGGCTCTT TTTGAGAAAC 
282501 AATGAATTTC ATGTCTTTCT AGTAAAACAC ACTATAAAAT AGTTCTTCTT 
282551 GTGTTTACAT TTATATAATC ATTGAATCAT ATGCCACAGT CCTGGAAAGT 
282601 TCAAACAGGA AATAGTGCGT GTGTGTGAGT GGGGAGCTAC TCTGTGGGAC 
282651 TGGTGATCGC TTGGTGGATA TTATTTTTCT TTTTTAATCA AAGTTTAGCA 
282701 ATCATACAGA AATATGTAGG CAATTACCAA ACTCAGTGAA ATGTACACAG 
282751 ATACCCATGC CTGTATAGTC AGCACCCAGC TAAAGAAACA GAACATGAAA 
282801 AGCACCCCAG AAAACTATCC TTATGCTCCC TTTCAGGAAC TAATTCTTAA 
282851 AACCATTGTC TTGACTTCTG ACATCATAGA TTAGTTTGCC CTTAAGTTCT 
282901 TTATAAAAAT TGATTCTTTT TTTTTTTGAG TCAGAGTTTC ACTCTGTTGC 
282951 CCAGGCTGGA GTGCAGTGGC ATGATTTCGG CTCACTGCAC CCTCTGCCTC 
283001 CCCAGCTTCA AGCAATTCTA CTGCTTCAGC TTCCCAAGTA GCTGGGATTA 
283 051 TAGGTGCGCA CCACCACACT GGCTAATTTT TGTATTTTTA GTAGAGGCAT 
283101 GGTTTCACCA TGTTGGACAG GCAGGTCTTG AACTCCTGAC CTCAAATGAT 
283151 CCAAGGCCAA GGACTCCCAA AGTGCTGGGA TTACAGGTGT GAGCCACTGC 
283201 ACCCAGCCAA AAATGGATTC TTATATGCAC TCTTTTGTGA CTGGCTTCTT 
283251 TGCAACATGT TTGTGAGATC CATACGTACT TTTGGGTGTA CTTGTGGTTT 
283301 GTTTATTCCC ATTGCAGTAT ATTATTGAAT TTTATGAAAT ACTACATATA 
283351 TATTTATACA CACACATATA XACACATATA TGTATATATG AAATACTGCA 
283401 TATATATTTA TGCACACATA TACACATACA TACACATATA CATACGTATA 
283451 TACACATTTA TGTGTGTGTA TATATGTATA TATATATATG TATATTGGAG 
283501 GCCTTGGCCT TGGATCATTT GAGGTCAGGA GTTCAAGACC TGCCTGGCCA 
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283551 ACATGGTGAA ACCATGCCTC TACTAAAAAT ACAAAAATTA GCCAGTGTGG 
283601 TGGTGCGCAC CTGTAATCCC AGATACTCGG GAGGCTGAGG CAGTAGAGTA 
283651 TGTGTGTGTG TATATATATA TATACATATA TACACATACA TATATGTACA 
2837 01 TAAACATGTA TGGGTATATT TATACATACA TATGTATATA TGTATATGCA 
283751 CATATACATA CATATGTGTG TATATATGTA TGTACATAAA TATATGCATG 
283801 TATAAATATA TGTATATATG TACATATATG TGTATATATG TACATATGTG 
283851 TATATATGTA CATATATGTG TATATATGTA CATATGTGTA TATATGTACA 
283901 TATATGTGTG TATATGTACA TATGTGTATA TATGTACATA TATGTGTGTA 
283951 TATGTATATA TGTGTATATA TGTACATATA TGTGTATATA TGTATGTATT 
284001 TCTATTGATG ATAGATCATT GAGTAACATC TAAATTTAGG CAGCTAAAAA 
284051 TACTSCTGCT ACTCAATTCA TTTCATACAG CTAATATTAC CCTGAAAACA 
284101 AAACCACACA GACAGTATAA AAAAGCTACG GATAAATACT TCTTATGACC 
284151 TAAATTTTCA ACAATATGTT AGCAAATTGA ATAAAAATGT ATAAGAAGAA 
284201 TTATACAGTG TGATCAAGTG AGATTTATTC CAGGTATGAA AAGCTGGCTT 
284251 AACATTCAAA AATCAATGCA ATTCCTCTTA TCAACAGGGT AGGGTAGGAA 
284301 AAAATAGAAG ATTATATCAG TTTACCTAGA AAAATAATTT AACAGAATCT 
284351 GATATTTATT TACTGCAAAC ATGCACCACA AGTTAAGAAT AGAGAGAAAT 
284401 TTCCTTAACT TGACAAAAAG CATCTACAAA AAACCTAAAA TTAACATCTT 
284451 AGATGATGAA AGACTCATTA TTTTCCTCCT AAGGTTTGTA ATAAAGCAAG 
284501 AATGTATTCT CACCATTCTT ATTGGATGCA GCATTGGAAG TTCTAGCCAT 
284551 TGCTACAAGG AAAAAATAAA TAAGGAAGGA AGGAAGGAAT GAAGGATGAA 
284601 AGGAGAAGGC ATACAAATTG TAAAGAAAAT TATCTCCATT TACAAATTTC 
284651 ATAATTACAT AGAAAATCTG GAGAGATCTA CAAAAAAACT TACATAACAA 
284701 GTGGATGAGT TCAAAAAATC ATAGGATGCA AGATCAGCAT GCAAAATTTT 
284751 ATTTTTTATA CCAACTGTGA ACATACAACA ATCAAAATTT GAAATACCAT 
284801 GCAAGCTAAA ATTTTTCCAA AGAAATTGCA ATCTATTTGT GTATACTTAA 
284851 TAAAGCATGT ACAGAATCTG TATGCTGAAA ATGACAAAAT GCTGATGAAA 
284901 GAAATCAAAT AAAATGTAAA TGATAGAAAT ATATTCTTAA TTTCTTAATA 
284951 AATTAAGCAT GTACAGAATC TGTATGCTGA AAATGACAAA ATGCTGATGA 
285001 AAGAAATCAA ATAAAATGTA AATGATAGAA ATATATTCCA TGTTCATGGA 
285051 TTGAGAAGCT CAAAATAGGG AAGATGTCAA TTATTCCCAA GTTTCTGTTT 
285101 TGTTTTGTTT TTACACATAG ACAAGCTTAT TCTCAAATTT ACATAGAAAG 
285151 GCATAAATCT TGAATTAGCT AAAACCATCT TGAAAAAGAA AAATGAAATA 
285201 TGAGGAATCA TTCTACCTGT TTTAAGGCTT ACTGTATAAC TACAAAAATA 
285251 AAATTATTGT GGTATTGATA GAGAGATACA TAGATCAATG AAAAAATGAA 
285301 AAAAAAAAAC AGAACTATTC AAATATGCCA AACTGATTGT TGATAAAGGT 
285351 GCAAATGTAG TTCAATGTAG AAAAGATAGC CTTTTCAACA AATGGTTTTG 
285401 CAGAGATATA GACCAATGGA TCAGAACAGA GCCCTCAGAA ATAATGCCGC 
285451 ATATCTACAA CTATCTGATC TTTGACACAC CTGACAAAAA CAAGCAATGG 
285501 GGAAAGGATT CCCTATTTAA TAAATGGTGC TGGGAAAACT GGCTAGCCAT 
285551 ATATAGAAAG CTGAAACTGA ACCCCTTCCT TACACCTTAT ACAAAAGTTA 
285601 ATTCAAGATG GTTAAAGACT TAAATGTTAG ACCCAAAACC ATCAAAACCC 
285651 TAGAAGAAAA CCTAGGCAAT ACCATTCAGG ACATAGGCAT GGGCAAGGAC 
285701 TTCATGTCTA AAACACCAAA AGCAATGGCA ACAAAAGCCA AAATTGACAA 
285751 ATGGGATCTA ATTAAACTAA AGAGCTTATG CACAGCAAAA GAAACCACCA 
285801 TCAGAGTGAA CAGGCAACCT ACAGAATGGG AGAAAATTTT TGCAACCTAC 
285851 TCATCTGACA AAGGGCTAAT ATCCAGAATC TACAATGAAC TCAAACAAAT 
285901 TTACAAGAAA AAAACAACTC CATCAAAAAG TGGGCGAAGG ATATGAACAG 
285951 ACATTTCTCA AGACATTTAT GCAGCCAAAA CACAAATGAA AGAATGCTCA 
286001 TCGTCACTGG CCATCAGAGA AATGCAAATC AAAACCACAA TGAGATACCA 
286051 TCTCACACCA GTTAGAATGG CGATCATTAA AAAGTCAGGA AACAACAGGT 
286101 GCTGGAGAGG ATGTGGAGAA ATAGGAACAC TTTTACACTG TTGGTGGGAC 
286151 TGTAAACTAG TTCAACCCTT GTGGAAGTCA GTGTGGTGAT TCCTCAGGGA 
286201 'TCTAGAACTA GAAATACCAT TTGACCCAGC CATCCCGTTA CTGGGTATAT 
286251 ACCCAAAGGA TTATAAATCA TGCTACTATA AAGACACATG CACACGTATG 
286301 TTTATTGTGG TAGTATTCAC AATAGCAAAG ACTTGGAACC GACCCAAATG 
286351 TCCAACAATG ATAGACTGGA TTAAGAAAAT GTGACACACA TACACCATGG 
286401 AATACTATGC AGCCATAAAA AAAGGATGAG TTCATGTCCT TTGTAGGAAC 
2 86451 ATGGATGAAG CTGGAAACCA TCATTCTCAG CAAACTATCG CAAGGACAAA 
286501 AAACCAAACA CCACATGTTC TCACTCATAG GTGGGAATTG AGCAAAGAG A 
286551 ACACATGGAC ACAGGAAGGG GAACATCACA CACCGGGGAC TGTTGTGGGG 
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286601 TGGGGGGAGG GGGGAGGGAT AGCATTGGGA GATATACCTA ATGTTAAATG 
286651 ACGAGTTAAT GGGTGCAGCA CACCAACATG GCACATGTAT ACATACGTAA 
286701 CAAACCTTCA CGTTGTGCAC ATGTACCCTA GAACTTAAAG TATAATAATA 
286751 ATAATAAAAA GTACACCCCC CAAAAAACAA AAAAAACAAA AAGTTTTGCA 
286801 GCAAGTGGAC ATCTATAGAC AAAAAAGGAG CCTCTACTTA AACTTCTTAC 
286851 TGTTTATAAA AATCAACTTA AAATGAATAA TGAACTTACA TATAAAACAT 
286901 ATAAAGCAGT TAGGGGAAAA AAAAACAGGA GAAAATCTTC AAGATTTACC 
286951 ATGAGACAAA TGGTTTTTAG ATTTGACACC GAAAGCATGA TCCAAAAAAG 
287001 GAACAAATGA GAAATGTACT ACATCAAACT AAAAAAAAAG TAGTAGAAGT 
287051 ATTTGCAAAC TACGTATTCA ATGAAGGACT AGTGTCTAGT ATATACATAG 
287101 AGCTCTCTAA CCTCAAAGGA AAAAGAAAAC TTCAAATGAA CAGAAGACAT 
287151 AAAGAAATAT TTCACCCAGG ACTATATACA GATAACAAAT AAATAAATGA 
287201 AAAAATCTTT AACATCATTA GCTATTAGTT AAATGCAAAT TAAAACCACA 
287251 ATAAAATATC ATGATACCCC TATCACAAAG CCTAAAATAA AAATAGAGAC 
287301 AACACCAAAT GCTAACAAGA ATGTTGAGAA ATGGAATCAT TCATAGATTG 
287351 ATAATCAAAA TGTTAAGAGG TACAGCCACC ATATAACTAG CAATTGTACT 
287401 CTTTGGCATT TATCCCAGAG AAATCAAAAC TCATGTTCAC ACAAAAACCT 
287451 GTACACAATA GCTTTACTTA AATAATTGTG GTATAAACTT ATCATGGAAT 
287501 ACTATTCAAC AATGGAAAGA GATGGACAAG CAACAACTTA GATGAGTCTT 
287551 CAGAGAATTA TTCTGAGTGA AATAAGCCAA CCCAAAAGGT TACTGTATTA 
287601 GTCCGTTTTC ACACTGCTAT AAAGATACTA CCTGAGACTG GGTAATTTAT 
287651 TTAAAAGAAG GAGGTCCAAT TGATTCTCAG TTCTGTATAG CTGAGAGACC 
287701 CCTGGAAACT TATAATCAGG CAGAAGGCAA AGGGGAAGCA AGCATGTTCT 
287751 TCACAAGGAG AGAGAGAGAG AGAGAAGAAG GAGAAGAGAG GAGAAGGGAG 
287801 AAGGAGGAGA AGGAAGAGAA AGAGAAGAAG GAGAAGAGAG GAGGAGAGGA 
287851 GAGAGAGAGA GAGGAGAGAA AGAAAGAGAG AGAGAGAGAA AGAGAAAGAC 
287901 AACCTAGGGG AAACTGCCAC TTTTAAACCA CCAGATCTCA TGAGAACTCT 
287951 AGCATGAGAA CAGCATGGCA TGATCCAGTC ACCTCCCACC AGGTCCCTTC 
288001 CTCAATATGT GGAGATTACA ATTTGGGATG ATATTTGGGT GAGGACACAG 
288051 AACCAAACCA TATTAGTTAC AGACTGTATA ATTCCATTAA TATAACATTT 
288101 AAAAAAT-GCC AATGTTTTAG AAATGGAGAA CACATTAGAG CTTGCTAGGG 
288151 GTTAGCTGTG GGGGACAGAG AAAGTGAATG TGGTGATAGA AGCCTTTTAT 
288201 TATATGGAAT TGATTGAATC TATTTTTGCC TATTTATTAA AACATCATAA 
288251 AGATAGAATG TTTTGAGCTA TATGCCTTGT AAATTCCATC TTAAGTGGAA 
288301 ATAACAATTA TGAGAATCAA ACGTATCAGT TATCTATTGC TAAGTAACCA 
288351 CCTTAAATCT TGGTTGCTTA AAACAGACAT CGTTTATTAC TTTCCGTAAG 
288401 CAAAAGTCAG, TCGGGTGGTT CTGATCTGGA CCAAAGTCAG CAGATTTTGG 
288451 CTGGGTTCAT TTACATCTGC AGTCATATTT AAACAGAGGT AGACTGAAGA 
288501 TAAAAGTATG TCAAGAAAAG AACACAAAAC AGCCAACAGT AATTTTAGTC 
288551 AGATTTTCTT TTATGGAAAA GTCTTGTTTA TTGCTATTTC TTTTCTTGTT 
288601 GACAAGCTTC AGATAGAGAC TCTTCAGCTT TATTTTGTGA GAGATTGCTT 
288651 ATTAGCCAAT GCGTATCTTG TACTTTATAT GATTACTGAT AAATGTCTTG 
288701 TTGAGTTGTA TTTGAAGTCC TGGAAAAATA AACCTCTTGC TCAGTCACCT 
288751 GGATCCACAA TAACTTCAGT CACCAAGTTC ACTCCTTGAG TAAGCAAAGT 
288801 CTCCACAACA TCTGCATATT TGTGGCCACA CAAGATTGAT TATTTTAATT 
288851 TAGTCTGACA ACTTAGCAAG TGTAAACAAG TTAATGAAAC AAATTATTGT 
288901 GCCTATGGGT CAGTTAAACA ATGTTTGGAC TCGTTTTTCC ATTTTCTCAA 
288951 GTAGCTGTTT AGTAAACGTG ACCCAGGAGT TGAATAAATG AGGTAAATGA 
289001 TATCTTAACA GAATTGGTTG AATATTTTGT TGTCACCAGA GCCAGTCAAA 
289051 ACCCCAAAAT AATTAACTGG AAATTAGTAA GACCCTCTGA CAGGGCCAGT 
289101 ATCAGTTTAC TTTTTTTTTT CTCAATATTC TTCAGAATAC TGAACTTGGC 
289151 AACGGGGTGG AAGATCAAGG AGAGGTGAGT TTGTTTCTGT GAAAAAGAAG 
289201 AAAACACAGT GTACATCTGT TCCACTAGGT TTTCTTCCTC TACCATGTAG 
289251 ACAGTTTGAA AATGTTCAAT GGCCATAGCA TTCAGCTTCC CTGTTAAAAC 
2893 01 AAGTACCATA AAGTATTCAT GAATAACAAA ATAATCGTAT TCTTTTTTTT 
289351 TTAATGTAAG AGAACAAGAT GCTAACCAGG AACCACATCG ATCCTGAAAG 
289401 CGCTTTTTGG ACAAATCACA AGCCTGTGAT TAACTACAAA GAGCTTATAG 
289451 AAAGCCAGAA CTTTCAAAAT AAAAATAAAA ATGTTGACAA AGATTATCAA 
289501 AGAATACTGT GCAATATATA TATTTTTGTC ATTAAAAAAG GAGTGCTCAC 
289551 GCTAGCTGAT CCTTGAATTT CATTAGAAGC CAGGATGGCT GCCTGTGGAA 
289601 TGCAGTCCTT TTTCTGGGGC TTACAAACTG GAGGACCTCG CTCAGGCCTT 
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289651 GTTAACCAGC ACATGCTGTA TATTAAATAA GATTCACATT TTTAAAAATC 
289701 TTGATTTCTA CTTCTTCTGA AAAATCTGAA TATCTGCTTC TGTGCAGTCA 
289751 ACAGGCCTGC ATTTGTGTAT CGTAGAATCA TCTGGAGCTG AGCGGTGGCA 
289801 CTGTCATTTT GTCATACTCC CCATCAGTCC CTTTGAACAC TGGGGACAAC 
289851 TTTACTCATT TACATGAGCT TCCAAGTCAC TGTAGACTTT TGAGTCCGAG 
289901 CCCTTTGTGT CACTCTGAGA CTGGACCTTA TTTCTTTTGA CCAAGGCAGA 
289951 TACCACTGAG GGTAATAGTG TTTGTGGAAC TCAAGGGAGT GACTCCAACT 
290001 TTGGAGCCGC TGAACTCCAA TAATGGCATG CAGAGGCTGT GAGCCTGTTC 
290051 AATACCTAAG CCAGGTTCAT GTCCTGGATC GAACTTGAGG TATCTCCGAC 
290101 AACTCTGAAT CTCCTGGGAT CCACTCTAGA CTTGCGAGAC TTGCAATTAG 
290151 ATTTAAGAGC ACGTTGCCTT TAGAATCATG GAGCTCAAAG AATCAGAATG 
290201 TAAGTGGAAA CTGAGGTTGA GAGAGCTTGA GGACTTTCCT GACCTTCTTG 
290251 ACATTACCGA GTTCTGAATG GGGAATTCAT TTAATTAGGT GAGAAGTTTT 
290301 GTACCTCATT AATCAGTTCT TACTATTATT TTAATTTTTG AGGTCTTAAT 
290351 AACAGCAAGT AAAAACATAC TTTGTTTTAT CTGCCACGAG TTAGCCCACA 
290401 GTAACCCAGT ACTGTGCTAC CCAATGTGTT AACTATGTTG TTTTGTGGTT 
290451 TTTTAATGTA ATGTTTTGTA TTCATTCACT GTCCTATGAT TGTATGTATG 
290501 TCAGGGCCAA GGTCTAGCAG CAGTGGCTTC ATCTAGGAGT TTGCTAGAAA 
290551 TACAGCCTCT CTCATCCCAC ACCAGAACTC TTCAACAGGA ATCTCGAACA 
290601 ATATCCCTGG GTGATTCATA TAGTGTTCAT TAGACTTTGA AAGACATTGA 
290651 CATATATACA ATCATATAAT ACTTAAATGA ATATATAGCA TAATATAAAA 
290701 GGACTACAAA ATAACATAGT TAACAATTCA AATGTCTTCA GGTATAGAGG 
290751 TTTTTGCAGC TACTTCCATG CTTCAGCACA GAATGGTATG GTATTTTTCT 
290801 GTTTATTTCA TTGGGCTGCT TCTAAGATTT TTTTTTTTTT TGAAGAACGG 
290851 GTTTCAAAGG TTAAAAAATA CAAAACAAGT TAAAAACAAC CTGTTATTTT 
290901 ACTAGCACAA ACTGAGGAGA GAAACTAGAG GAATAAGGTT GAATTAGCTA 
290951 GACAAAGCTC TGTGGAGTGG GTTGGTAAAT TTGAATGGCC TTTGTTGATA 
291001 TTGAGAGGCA CAAGCTGGCC ATGAGGTCTA ATTGGAATTA TAACTTGAAT 
291051 CCTAAAGAGC TTGCATTGTT GTCGTTGTCA AAAGACAAAA TTACAACAAA 
291101 CTTAGTTATA GTTCTTTCTT ACTCACAGTT CATGAATCAG GACAGCTGCC 
291151 GTTCTACAAA ATAGAATGAG AGCTCCCACT GGGCACTGGC AGAGCGGTGG 
291201 GTTTTGTAAG GTGGGGACAA AGAAATGAAG CAACAGTAAA AAGCTGACTG 
291251 GTTAATGTTA AGTTAATTTT TAAATGAAGG TTAAATCAGA GAGGATCCAC 
291301 TTATTACACT GACTCAGGGA GCCTGAAATC TCCTATTTTC AGGAAAAACT 
291351 GTTTTTTTTT TTTTTTGCGG GGAGGGGATC TATCTGCTTC CTTAATGTTT 
291401 CAGTTTGATT ATGTGTCATT CAGCATGAGT CACTCCATTT TGGTTTGCAC 
291451 TAGTGTTGTG CACTAGTGTT GCACTAGTGT TGTGGCCTAA TGTAGGAACT 
291501 CAGTCCAAAG CAATAGCCTC TCATGATCTT TGATTAACAG GTCTTTATTG 
291551 TGTCTCTTCT GGCGTGTCTG GGCTCCTCTC TTCTTCAGCT ATCTCCTCCC 
291601 CTATCTCCTC CTCCTTTTCT TTTCCCTTGC CTTATTCTTC TGTGTTGTCC 
291651 TGCCTTGCAT TACTAAACTC TCAATCCTAC GTGCCTGACC AAAGAGCTGA 
291701 TCATGTTAAC CCACAATTAT CTGAACTTCC ACCAAGGGAA AGCATATATA 
291751 CCCTAGGATT AGAGATGGGA AACGTGCCTG AGATGAAAAT ATTTAAGTAA 
291801 GGAAGAAGGG TTGGGGCTAG GGTTCTCCTA CTTTGTTCCG CACTACCCAC 
291851 ACACATGTAG TTAAAGTTAA AAAGGTTGCT TGTACTCTCT GATACAATTT 
291901 GGGAGAACAT TGAACTTACT GGAAGATTCG TTAGTGCCAC ATTCAGTCAC ■ 
291951 CAGGTTAGGC ATAGAAGGAA , CAGGGCCTTA ATATTAATAC ATTGATTATT 
292001 TCCTATGTTA AGAAATTCTG GAAATTCAAC GCTGTAGTAC TGCCATCCTG 
292051 TGTAGATCAG ACTGACTAGG CCGGGATTTC CAAAAGTATA CTTTTATGAG 
292101 TGCGTGTGAT GTTGATACAG CATGGTATTG TACACTGCAT TTCAATTTCT 
292151 TTTCATATCA TTTGGGGACA TCCATATGGG TCTTGTTGAA AAAAAAATTA 
292201 CATTTCCTTG TGGCCAAATT ACAAGAAAAA TAAAATACAA AGTTTTAGGT 
292251 AAGATATTAA ATAGATATTA AATATTGAGT TTATATAAAA CATCCAAATA 
292301 ATTTTTTAAT AATATTTTAA TAAGCTTCAT TTTATGAGAA ^TACTGTTTTG 
292351 TATAAGGTTT TCTTGCTTGA ACAGCTACAA GTAATTTTTG AAGAAGGCAA 
292401 TTTAATGGTG ATCCCTTTTA ATTCTTTATT ATCACAAATA GCATTGGATA 
292451 TAAGGAGTGT GTATATTACT GTTTTCTTCA TTTGACTGCT TCTTTCTTGC 
292501 TAGCAACCAG AATTTAGTTT CTATAAATTA TAGGTCAAAG GACTTGTAGT 
292551 TCTGTAACTC TCATGATTTT GATTTGTTTT TTTTCTTTTC AGTGAAACTA 
292601 TAATGTTGAA AATATTTGTG ATAATGCATG CATATGGATT TATCAGTTTT 
292 651 TTATATTGTT TTCCAAAAAT AATGATGATG CTTTTCTTCA ATTATAGTCA 



WO 02/086113 PCT/GB02/01887 

97/21,9 

292701 GATATTGTTC TTCTAGTGTT AAAAATTTTC TTTCAGAATT CAAATAAAAT 
292751 GCTGGTTTAA CATAACCAAT CAACCATTTC AATCTCCCTT TTTAAAGTTG 
292801 AGTTTTTCTG AATCCATAAA TTTGGTCATT AAATATTAAC ATAATTTTAC 
292851 ATGTTGCAAT TTACATTTTG CATAATTCAC ATTCTTGCAA ATTTCAGCTA 
292901 AGTTCATTCC AGTGCTAAAA AAATGAGAAG TCGATTTTTG AAAGCAAAGT 
292951 TGAAAAGTAC TTTGGATCTA GTGATAGTGT TGTGCTTTCA TAACTAGCCT 
293001 CTTGAACTCC AGCTAATACT GTTGTAAATA CCTTATAGAA AATTTTTGTT 
293051 CCATATCTTG CCCTTGCACA CATTTAAAGT TCAAATATTT CCTGTACGGA 
293101 TAGTTGCTTT TGACAAGGTA CCAGTGTAGT ATGTGACAAT TTGTTTAATA 
293151 CCTTTGCATG CAAATAAATA ATAAGTTTGG AAGAAAAAGG AAGTATCAGG 
293201 GCCCACAAAT AGACATTTGA GAGAATCCTC ATGTTGCTCT TTAGTCCTGT 
293251 CAGCTTCTGT GGTGGATTCT TCATTTGTGT TATAAATCTG GTGACTTCAA 
293301 TTTTAATACA CTCTTCCTAA TTGTGGTGGG AAAATAAATG TCAAGTAAAA 
293351 ATTAAATCTG TTTAAGCATA AAGTTCTCAA CAGAAATGAC AATCTTTACA 
293401 AATTTAGCAA ATTAATTTTC TGATTATTAG ACTATTAAGT TTCAGTTAAT 
293451 TGAACAATTA AAAATGTTTA ACCTTTATAA AATGTGATTT TAAATTTTAA 
293501 AGGCCTACAG TGTCTTTAAT AAATCAATTT AAATGAATTA GGCTCAATTA 
293551 TTTTTATGAG AAATAAAAAT TTGCTGAAAC AAACAATAAT CAGTATAAAA 
293601 TTCCCTCAAA CCCACTCTGG CTAATTTTAA ATATCAGAAG CTGATAAAAA 
293651 CATAAGTAAA TCCAGAGGTT TTAGTCTAGA AAATTAACAA CACCCGTTAT 
293701 CTTCTAAATA TCAATATCTA AAAAGAAAAA TGAGAAACAG CAACAGCAAT 
293751 CGGAAGTGAT TTCTAGATGA GTAATCACAA AGATAAGCAC GTTAAAGGGA 
293801 AAGAACAGGA AATAACTTAC ATGCACTAAC AAACAGAAAA CGTTTATTGA 
293851 AAAGTAAATC ACAGAGCAGT GGAGGTGATT GAACAGGTAG ATGCTGTGTT 
293901 TTATTCCAAT AAAGTCTTCA TCTGCTGAAG TAATAATTGT CAAATTCAGA 
293951 AAGTTACATA CAAATAGTGT TCGAATGTAA TTTCAACTAT TTAAATTTTG 
294001 GAATAATTGC AAAAATGGCT TTAAATTTTA TCACCTCACA CATTTGACAG 
294051 CTATCAATAT GTCAGGGCCC ACCTGTGCCT CATTTGTTGC CCCTGGCTGA 
294101 GTTACTCTGG CCTAGTGGGG AAGCAACGGG ACAGGGGTGT GGGACTTCCT 
294151 GGGTTTGAAG CTAGGCTCCA TGGCTTATTT GTTAGGTGAT CTCAAGTAAG 
2942 01 TTACTTTGTG CCTCAGTTTC TTTATCTGTA ACGTGATGAT AATAGCCTAC 
294251 TACACTTTTG GCTGGCATGA GGATTAAGTG AGGACACATA TTTCATCCCT 
294301 GAACAATGTC TAATGTAGAG TATGTTCTCA ACTAGGGTGG CTCATTTTGA 
294351 TCCCTACTAC CACTAGTAAT AGTAATCCAT GAAAATAGGG CCTGTGGTCA 
294401 AATGATAAAT GTTGAATTAG GGCAAAAGTA AGCAGCTGTT CTAATTACAG 
294451 AATTATAGTA CTTAATATCC TTCTAATATG CATTGTGAAT CTCTGAGCGG 
294501 TGGCAGAGCC ATAGTGTATA GTGTTTTCTA AATTTACATA GCCGTACGAT 
294551 AATTATTTTG GCCCAAGGAT CTCATGTGGA CACTGCTGTG CAGAAGACAT 
294601 TTTGGAAAAC ACCAGCCTCG TGGCATCCCC AGACTAAATG AGCCTCTGCA 
2 9 4 6 5 1_ ACATCCTTTT AATAAGAAGG TCCTTTCCAA AGTCCAGTCC CAGCCCTGGC 
294701 CCAGATCCTC CTATTTGGAA GCATAGGCTT ATGCTCTGTG TCAAGGCTGA 
294751 TTTATCAGAT GGTAAAGCTT GATAGAACTA TAATAAACAC TCTTCTTGAT 
294801 CATGCTAAGT AATTTTAAAT CATCTAACAC TTTAAATCAG CCTCAGATTT 
294.851 TTCTTGCTTC TTTGAAGACC TCTGCCAGAG AGAACATTTG GAGCACTGGC 
294901 TCCCTCAGAC ACCCTTTGGG TTGCATGTGA ATATTTCCAT CGTATTTGAA 
294951 TCTGTTTTCA AAATAAAGCC TGATTAGAAA CTACTTTAAA TGTCTGACCA 
295001 GAGTCCAAAC ACCCTCAGAA GTCTTTGTGA AATGTCATCC ATGCCCGGAT 
295051 CCCTTATGTA TTTATAATAT TCCTGCTGTA TCTTTTTCCC CTGCACTGTT 
295101 ATGTTTTAGA TATTCATCCC CTTCTTCTCT CCAGGAAGGA GTGTGCAACT 
295151 AATATTAAAC ACAGAGACAG CCTTCCTGAG CAGCTGCCAA CAGATGCACA 
295201 GCCCTTGGCG CAAGTTCAGC TGTCAAACAG CCCTGGCCCA TGAAACAAAG 
295251 GGCAGAAGCT TCCCAACCAC ATGCGGATGA GCAGAAACAG CTGCATTTGC 
295301 TGTAGCCAGG GGCCCAGCAT CCAATTCCTG TGATTCTTAA TAAAAAAATA 
295351 CCAAGCACAT GGAAAGGGAA AATTAAATAC TATTCTGACA AATACCATAA 
295401 TTGCTAACAT CATCTCCCAG GAAGGCAGCA TGGCAGATAA CCCCTATTCA 
295451 TGAGCCTTTG TCCCATTGCG ACAATCAGTG CTCAACACAC TGCATTTTAG 
295501 CTTTTTACTC AGTCAATGGC CTGGCAAGAG AATTGTCTTT TTTTCCCTGG 
295551 AGCTTGAAAA TGGAGTATAT GAATTTGGGA GCATATGAAG TTGGTAGGAC 
295601 TAGCTCCTGG AGGTAAAAGT ACCTCAGATT ATACATAACA AGTGATTATA 
295651 TTTCAATCAG CAACATCTAC CGTAAAATAC ATAAGCCTAG GGGCACTTAA 
295701 GAAAGCATAT AAACCTGGGT TTGAATATGG TGGTAGCACT CAGCTGTGTC 
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295751 CTTCAGCAAG TTTGTTAACC CATCAGAGAG TTAGTTCTCT CAACTGTGAA 
295801 CTGGGCTGAT GTGAAGATTG AAAGAAATAA TACCTGTAAA ACCCTTAACA 
295851 AATGTCTGAA TACAGTGAAA GCTCAATAGG TGTTATTTTT ATTATATATC 
295901 TATTTGCTGG TCCAATAATT TTTATAAGAC CAGTATGACT TCTCCACCAG 
295951 AGACTGACCA GGAAAGAATG CCTTCTGAAT GACCTCTCTT GCAGCAATAG 
296001 CACGGTAGGA CCTATGTTTT CCTCGAAGGG TTAATTTCTC ATTCTCTGGC 
296051 TATGTCTAGC ACTAACAGTT CCATTAGAGA ATCTCTTGCC CTCATGATCT 
296101 TGTTCTGCAT CTGGTAAGCT CCCCACATTT GTTCCCCTCT TCACATCTCT 
296151 TTTGCAAAAT GAGCTTTCTA TTGGTGGTTG GCTCTGCCCC ACACTTTCTT 
296201 GCCTTTATAA CTCTGTGTAG GCATTTTCCA TTCTACTTCC TACTTCCATC 
296251 AGCTATAGAA GGCCCAACAC CCATTTCAGC TCTACTCCAC TCTAGCACAG 
296301 TAATTGTAGC AGGGGTTTTC CTAATTTCCT TGAGGGAAGA AGGAGAGAGA 
296351 AAGCAGAGGC AGGTAAAGGG CCCTTGAGAG GGCTCCCTCA CGAGCTCCAC 
296401 ACCTGTAATC AATGCCAGAA GACTCAATGC ACATTCAGAG GCCACCGGTG 
296451 TAGACTGGAG GCAGCCGAGA AAGACTTTGT GGAGGAGGCA AGATTTGAAG 
296501 AATTGATAGA ACTTGGCTTA TGGAGAAGCA AAGGGGGAGA AGAGGGAGAA 
296551 GGTGAGGAGA AACAATTCCT GAGTATTAGA AAAAGGTCTG AATTTGGAGC 
296601 TTTAAGATCT AGGTTTGTGA TCTTGGGCAA GGTACCTAAA AACTCTGAAT 
296651 TTTAGTTAAT CCATGATGTA TATATGGGGG CGGGGGGTAA TGTGAACTCA 
296701 GAAAACTTGT GTGAGAAGGT GCTTAAAAAT TAGTTGAATT TGATTTTATA 
296751 GAAGAGACAC TAGTAGACGT ATTTAGTTGT AAGATTTTTA TTGTGATATT 
296801 AATGCTGCCA CGCTAATAGG GTTTAAAGAT GTAGTTGAAT CCTGTTATAA 
296851 AACAAGCATC ATGTCAGATC CATAATTAGC ATTTAAGTAT TCATTGAGTG 
296901 CCCATTATGC CCTGGCAAGC TCATACCTCA AAATTGTGTT CTTTTACTTA 
296951 CATATGATTT ATACTAGGGT GTTTTATGAC TTTCAAAGAA ATTTGCCTTA 
297001 TCAAGGAAAC CAATTATACC TGATGGACTC CTGCATACTC ATGTTTTTCA 
297051 ATTTAAAATT ATCTTTGATT CTACTGATGA TCAGAAAATT TGTGAAAATT 
lil^l TTAGAAAGTC CAAACTTTTA AAGTTTGGAA AATACAATGA AGAAATAAAA 
llllr^ AATCAGAATG AACTGTTTCC TACTCAAATT TTCATTCATA ACATAATCTT 
297201 TGGAATATTA TTAATGGATT TATATTATTC CAAAGTGTGG GTATCTTGTA 
297251 ATTTATGAAT ATCTCATCTG TTCCAAGCTT TTGCTATTGT AATTAAACTT 
297301 GCAATGGATA TCCTTTGTAT AAAAATCAGT CTATAATTAC TTCCTTTCTT 
297351 TGCATCTCCA TGAAGGGAGG AGAATGTTTC TTCCTCATTT ATAGGCTCAA 
297401 CAGCCCAACT TGACCCTCCA GTTTGTCACA CACATCTTTT CTGCAGGATA 
.297451 AGAAATATCA GTTTCCTTGC ACACTAGCAG AAGTGGTTCA TTTCCACTGG 
lllll 1 AAACTTACCA GAGGCATTCT CCACCAGATA CTGCATTTCC TTAGGAAAAG 
297551 TTTTACCTAT TTTGCTGAGA AGATTTTACT CACTGAAGCA TAAAAATACA 
297601 TGCTCTCAAG ATGAATACAT GTCAGGTAAG TTCTAGAAGG ATTAATTTTC 
297651 AGTTATTTAA TTCTGGAATT TAATTAGATC ACCTTGGAAG GCAAATTTAA 
297701 AGAACATTCT TAGACAAGGT ATTCATAATT CGAAAAGGAT TTTCCTAAGA 
297751 ACTGATAAGC ATGGTGGTGT CTTTCCCTTT AGTGCTTTTT CCTGGGGGCA 
297801 TTTAGAATTC ATGGAAAACA GTTGTCAAAC CTCAAAAAAT GGACTAATGA 
297851 ATGAATATAA GCTCAAAAAA TGGACTAATG AATGAATATA AGCTTTTACT 
297901 CAAGAATTAG GAAGGATTAA CATCCTTTTG CTCATTCTGT GACCAGATGT 
297951 CTTGTATAGG GTTATATATA TTTCATTTCT GTTTCTCTGG TTAAGGGCAT 
298001 CAGTTGAAAG TGTTTGTTAC AGCTGAATTG ACTTGTGCAT TCATACTCTA 
298051 AATATTATTT TAAAATATCT AATTAATATA TTATTCACCG AGCGACTCTT 
298101 TTTTAGTAGA AGCTATCACA TCAGAAACTC TTTATTCTGA CAAAGTGACA 
298151 TATTGTATAA TTTTATTCCC AAAGCTAATT TTTTTATTTT AAGACATTTC 
298201 ATGAGATGTT AAAAGCCCCT AGTGAAGACT GTTAAATGAG CCATTAAGGT 
298251 TAGGTTATCA ATGATCAACT AACCAAACTA TCTTAATTTT CTTTTGATTG 
298301 AATAATAGTA ACAGCAACGG TAGGAACACA CATATGTGCA ACTAAAAAAT 
298351 AAATAAAAGT AAGGACAAAC TAAAAAATAA GAATTATGCA ATGGGCATTA 
298401 TTTAACTTTG AAAATGTCCT ATAAGATTTC AAGGGAAAAA GTAAATAAAA 
298451 TGTATATGCA CGTGCTTACA TTTTCTGGAA ACTTCTGGGA GCAACTATTT 
298501 TGCATTTCAA TATATTTTTA TTTAATATAA AACTAAAAAG TTGTCTTAAA 
298551 TAGAAGTTAA AAGCAGTTTG CTTTCCATAC GCAATTCATT ATATCAAATT 
298601 GGATAGTGTT CTTGGCTCAG TCAGCAGCAG CAQCAGCAAC ATTGAGTCCC 
298651 CTTGGGTGTG GCTCTGTATT AGGGTACAGC TAGTAAAATT GTCATTTGAG 
298701 ATAGAGAAAT AGTGTCATCT CTTAAGAAAT ATGCTATTTT CATTTAACAA 
298751 ATGTTTGTTG GGCATCTATA AGCTGTCCAG ATGCCATGCT AAGCACTGGG 
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298801 GACACAGGTA TGAGCAGCAT ATACATGATC CTGGCTTTCC AGCCCTTAGG 
298851 TTCTAGTGGA CACTAAGACA AGCACACAGG TACTATGAGA AATAGTAGAA 
298901 AGCTCCTGTG CTCCAGGAAA TTAGAACTGA AAACAGTTTG TAGTGCCAAG 
298951 TAGACAGGTG GAAAATATGT GTTTAATAAG GAATTAGCAT AAAATATAGA 
299001 GATACACTCA GTGTGCATTC CTATACATTC AAATACAGGA ATGCACAGTG 
299051 GCAGCAAAGT GTGGTGGTGA GCAGATGAAC TCTGGGCACA AATTGCCCAG 
299101 TTTCAATCCC GGCTCCACTA TTCCTATAAG GTAAGTGGAG CAAGTTACCT 
299151 TTTGCTCTGG AAGCCAGGAC AGGGATGGTG GGCAGAGAGA GAAAGGGCCA 
299201 GAGAGCGGGA AGTGAGGACT GCTGGAGTAT AATTAAAGCA AAGCAGGTAT 
299251 AGTGGTTCCA AGAAGAAAAA TTTCTTTCAA GAAGAAAAAT CCAGAAGAAT 
299301 GGCTTGTGCA GCAGGTGGCG TCACACAGCA TGTAAGAGAG GGACTTACTT 
299351 TAATGCTGGT AGGAACCGGT AAGAGGACAT AGAAACCCAG AAAGATAAGG 
299401 GTACTTGCTC GAATCTCTCA ACCAACAAGT TCAGAAATCT CATGTAGGAC 
299451 GTGGTGAGGT TTATCAAAAT GGGGCATGGG ATGCAATGAA GTAGCAGAGA 
299501 CACTCAATAA TTTCGACTTG GAAGACTGGA GGATTCATTA CCATAAAAAA 
299551 ACTGAACATG GGAAATGAAT TATAATAAAA ATATAATGAC TGGGTGTTTA 
299601 GTTTGAATAA AGAATAGAAC ATCACTGTGT AATAATCGCT GAGAGGTTTA 
299651 CAGGTCAAAT GGAAGATTGT TCTGCAAGAG AAAAGAAAGG TAAACACTGC 
299701 AAGCAACTAC AGGCCATGAA CGATGGTAAA ATCGATGTGG GAGACAGAGC 
299751 ACTGAACTGG GAGTCAGGAT ACCTGGCCTC TACTGAATTT TTAGCAAATA 
299801 ACAGACTTTT CCTGCACCCT AGTTTCCTCA TCTGTAGGCA TTACATAAAC 
299851 AAATATTTTC ATTTATGTTT CTGTGAGCTC AAGAACATTG CCTGGTATTT 
299901 CATATTATTA AGTGCTTAAT TTTCTATTTT AGTGAAAGTA TGCTGCAAGA 
299951 ATTTAGAGAC AGTGGCAGGT AAATAAAGAC AGCAATAAGG CAAAAAAAAA 
3 00001 AAAAAAAAAA ACCCAGCAAA TTGTCAATAA AAACAGTAGT TAAAAGCTGA 
'300051 TTTCTGGAGG CAGACTGCCT GAGTTCAGAT CCTGGCTCTC TGCAGTGTGA 
300101 TCTTGGGCAA GTTACTCAAC TTCTTCTGCC TCATTTTCCT CATTTATAAA 
300151 ATGGGAATAA TAATATTATC TTCCTCATAG TATTTGTGGA AAGATTAAAC 
300201 AAGTTAATAT CTATAAAGCT GCTTAGAGCA GAGCCTTTTA CATAGGGAGT 
3 00251 GTGCAAAATT GCTATTAAGC TTCACTGAGG AGATGAAACT CACTTGAGCT 
300301 GGATATTAAA GAGAGAAAGG TTTGGAGAGA TATTTCAGGC CAGGGAAGCA 
3 00351 GCATGTGAAA AGGAAGAAGC AAGCATGAGT TTGACAGTTG TGCATGGAAG 
300401 ATAATCGTGT CACTTTCTAG GGGAGAAAGA AACCAGGATG AATAATGGAG 
300451 AGCATTTATT TCTAGTACAG AGGTTCCTGC TACAAGGATT AGGAAATCCG 
300501 GTCATACCTA CATTGTGCCT ATCTCTGTAG AGAGGACAAT GGGACAATAC 
300551 TGGAGGTTTT CAAAAGGGAG TGACATACCA TGGCTTTGGA GTAGCTGGGT 
300601 CTGATAGGCA CATTCTGATT GAGTATACAT TACTGAGGTC ATTTGGCCAG 
300651 GGTTTTCTGC AAGAGTAAGG CAAAAGATAA TGAATATTTT TAGAAAATAG 
300701 TAGTTTCAAC ATACTCCAAA AGGTTTGGTA TTAGTGATAG AATGGGCCAG 
300751 GGTTTAAGCC ATGAAAGAAA AGCCTTCTCT TTGGGCCTTT TCATCTAAAT 
3 00801 TACAGGTGGT ATTGGGGTCC CTGGAGAGAG TTCATTTTGT AGTGAAATGC 
300851 TATGTTTTGT GTGGAGAAGT TTTCTATCAG CGCTTTTGTG AAATTTGGAG 
300901 GTCTTTAAAA AAATCTGTCT AAGTAAATGG TAACTAGGTA ATTTGGGAGA 
3 00951 GGAACAAGTA TAGTAATCTC ACTGAAAAAG TTACAGTTTT ATTTGCTAAG 
3 01001 GATTATAGAA ATAGTAAAAA AAATGTGAAC GTTTTCATGG GCTATTACAT 
301051 GTATGTCTTT GTTATAAAAT TTTATTGCTT AAAAGATGTC CTCATCCACA 
3 01101 GCCATGTCAC CTGAGCCTGT GCAGTGCTAG TATTTGAGCA GATGCTGAGT 
3 01151 TTTCATTTTG CTTCAAAGTC CATTAATTCA GAACTCTTTA AGAGTGCCTT 
301201 TAAAAGTTCA ACCACATTTA ACTCATCTTA TCTACCTTTG CATTTCCCAT 
•301251 CCCAGTTGAC TAAAGTTTTT TAT ACT AT AA GTGATCACAT GTTAAGTAAG 
3 01301 ATCTGTAAAT ACTGCTTTAG CTCTGGGAAA TTGTTTGAAA GCACTTTAAA 
301351 ACATGGAGTC TTTCTCCATT ATTGTATTTC TTTCTGTATT TATTGCCTAT 
301401 CATCTTAAAA AAATCTGAAA ACTGAAGCTA TGACAAAATA ATGAATTTGA 
301451 ACATATCACA AGTTGATAAC TAAGATAAAG AATGTTCTTT TATAAGACAT 
301501 TGCAAAGGGA GGTTTTATCC TCTGTAAGTA TGCCAAAAGA ATCTACTTGG 
301551 TCTCAGGTGT TGCCATTGGA TGTCAACAGT GATAGGCCAT TTAAAGATAA 
301601 GGCCATAAAC TGATATCTAG AGAACTTTTG ATTAATTAGC AGCTGGAGGA 
301651 TAAATGCAAA GAAGCTTCTT GAGTGTCTAG ATAGACTAAA GTTTGCTGGA 
3 01701 TGAGGTAGAT AGAGACATCC TTCTGGTGGT TTCTCTATTG TCTGAGAAGC 
301751 AAATGCTAAC TGAATATATA GATAGACATT ACACACACAC AGAAATTTTT 
3 01801 ACCTTCTCTC TTTTCACAAC CTTGCTATTG CCCTTTGGCA ATTTCTTCTT 
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301851 CCTTTATATT GGACAAGACT GATCCTGATA TCATTTGTAT TAGTCAGTGT 
301901 CCAGTCAGGA AAATATAAAC CATATCAGAT ATTTTAACAG AGGAAAGTTA 
301951 TTAGAGGAAA TTGGCCAAAC AAGTACTGAA GGACTAGATT TGAGATTTTA 
302001 CAGAGCTAGT TATGAAGGAA GCAGCTACCA CTTCTAAAGT GGTGGTGGGG 
302051 TGGGAGTGGG GCAGGGAGAG AAGATGTTTG GGGTTCTTAG GATGAAGAAA 
302101 CCTGGAAAAG GTGCCTGTGA AACTGAAGTC CAGACCTTTG GATGGAGGGG 
302151 CTCTGTTTAG CTGGTAGCAC TATATCAGAA GCTCAGAGAA GGGACACTAC 
302201 AGAATTGGGG CCTACACCTC TAATGAGAGT GTTGCTTATA CTATTTTGGG 
302251 GATGCTGCAA CAAAATACCT TAGACTGGGT ATCTTGTAAA TAAGGAAATT 
3023 01 ATTTCTTACA GTTCTGAAGG CTGAGAAGTC CAAGATACAG GTGTCAGCAG 
302351 ATTCAGTGTC TGGTGAAGAT CTGACTCCTG GTTTGTAGAT GACTGTTTTC 
302401 TTGTGTCCTC ATAAGGTAGA AGTGCAAGAA GTCTCTCTCG GGCCTCTTTT 
302451 ATGAGGGCAC TAATCCCATT CATAAGGGCT CCATCCTCAT GATCTAATCA 
302501 CCTCCCAAAG GCCCCACCTC CTGATATCTT CACCTTGGAG GGTTTTGAAC 
302551 ATATGAAACT TAAACTCTGG ATTTTGGAGA GACACAAATA TTTAGACCAT 
302601 AGCAGGGTAC TTTGCCACCA CTTCAACTAT CTATGAGGGG TCACAATGAG 
3 02651 GCTGGTTTGG GAACACTCAA TGAGACGCAG GAAATGGAAC CAACTGCTAT 
302701 TGCCAGGATG AAAGGCCATT GTTGAGGTGA CACCTGCAGG AATGTGAGCA 
302751 AAACTGGAAG AAACAAGTCC TTTGTTTCTC CTCCAGCCTC TAATAGGTTA 
302801 ATTACAGCAC AAGAATTTTA ACTAAGCGTT ACTGTTATCA AAGCATGCTG 
302851 ATAGAAGGAC CTTGGAGACC CTCTCACCAT ACACAGACAT ATATATATAT 
302901 AATATGTATA TATGTGTGTA TATATACATA TATATGCATA CACATACATA 
302951 TATACATACA CATATACATA TACATGTACA TGTACATGTG TACATGTACG 
303001 TATACATGCA CACGTGTATA CATGTAGGTA TACACGTGTA TACATGTAGG 
303051 TGTACACGTG TATACATGTG TACGTGTATA CATGTGTATG CATGTGTACA 
3 03101 TATACGTGTG TATGCATGTA CGTGCGTATA CATGTGTATG CATGTACGTG 
303151 TGTATACATG TACATGTATA CGTGTATACA TGTACATGTA TACGTGTATA 
303201 CATGTACATG TATACGTGTA TACATGTACA TATACATACA CATGTACATA 
303251 TACATACACA TGTACATATA CATATACATA CACATACATA TACATATACA 
303301 CATACATATG CATCTATATA TATATGCATA TATATAGATG CATCTATTTT 
3 03351 ATAGATGCAT CTAAGAATAG GGTCAGTGGT GAACCATAAA CTAGAATGAT 
303401 GAGTATGATT CAGGAAAAAA AACCAAAAAC TAACATCACT CTTGTTCCCA 
303451 TTATGGGTCT ATAATTTTAT TAAGTTATTC AGAGATAGCA TCTCAGTTAA 
303501 TAATTCTCAT TTAATAATCC CAACAGCCTT GTTAGAGAAG TGGGATAATT 
303551 TCCATATCGA AAATGAAAAC AATCAGACTC AGAGGATTTA AGTAATTTCA 
303601 GTCAGTCAGT AAGTAGCGGA TAGGATTCTG GCCAAGGTCT GTTTAACAAC 
3 03 651 AAAGGACATG TTTGAACTCT CATCCCCTTC AATGGAAAAT AAGGCAAAGC 
303701 CAAGAAGCCA AAAGTACAGG CACCCAACAA AGTGGGAGGC ACAGAACAAG 
3 03751 GGGGAGGGCA GGTATGAACA TCCAGAGCGA CTTTGATTCT AGGGAGATGC 
303801 TCACTCGTCT ACGGAAGGTG TATTGTATTC AGTTTCTGTA CTTCCGGTCT 
303851 TCACTCTAGC CTTGCCTTCT GGATTGTTCT ACCAAGTACT TTGGCTTTTC 
303901 TCATAGTCTT TCAAAAATTG TTAATATTAT TTTGATTGAC CAGTCATAAT 
303951 TGTACACATT TAAGGGGTAC AATATGATAC TGTGATATAT GTATACAATA 
304001 TGAAATTATT AAATCAAGCT AATTAACATA TCCATCACTC CAACTACCTA 
304051 TCATTTTTTA TGGTAGGACA TTTGAAATTT ACTCTCATTT TGAAATGTGC 
304101 AACACATTGC TGACTATAGT CACCCTAATG GGCAATAGAT CTCAAAACTT 
304151 ATTCTTCTGC CTGTCCTATT TGAAACTTGG TACCCTTTGA ACAACAGCTC 
304201 CCTACTTCCT CCCTCCCCCT CCCCTGCCAG GCTTTGGTAA CCATCATTCT 
oa^a 1 ATTCTCTACT TCTGTAAGTT TAATGTTTTA AGATTCCACA TATAACTGAG 
304301 ATCTCAAACA AACTCAAATC AGTATGTCAA AGGAATATCT TCTGTCTTAG 
304351 TCTGTTTTCT GCTGCTACAA CAGAATACCA CAAACTACGT AATTTATAGA 
304401 GAAAAGAGAT GTATTTGGCT CACAGTTCTG CAGGCTGGGA AGTCCAAGGT 
3 04451 ATGATGCCCA TATCGGGTTG GCCTGGGGAG GTTGTTCTTG TTGCATTGTC 
304501 CTTTGGTGGA AAGCAACTGA GCATGTGAGA CAGGGAGGAA ATTGGGCCAA 
304551 ACTAATTATT TTATTAGGAG CCTATTTCTG TGATAACTAA TCCCATGATA 
304601 ATGACATTAG TCCATTCAGG AGGGTAGAGC CCTAGTGACC CAATCATCAC 
304651 CTAAAGGTCC TACCTCTCAA CATTGTTAAG ATTACAGTTA AATTTCAACA 
304701 TAAGTTTCAG AGGAGACACT GAAGACACAG CATCTGCATT CCCATGTCCA 
I 0 *! 5 * TG TAGCTATT CACAATAGTC AAGACATGGA ACCAACCTTA AGTGTCCATC 
MTOTGM TGGATAAAGA AAATGTGTTA TACATATGCA ATGGAATACT 
304851 ATTCCGCTTT AAAGAATAAG AAAATCTATT ATCTAGCACA ACATTTATAG 
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304901 ATCTGGAGGA CAACATGCTG AATGAGATAA GCCAGGAACA GAAAGACAAA 
304951 TCCCATAGTC CTGAAGTCAA ATTTTTGCTG CCACAAACTG ACGCGTCCAT 
305001 GGTGGCAGCT ATCCCTGATC TCTCATGCTA TTCATGGGTA TGATACAACA 
305051 GTGGCCAACT TCATTCTCAT GCTTTTCACT TCTTGCCTCC CACCTTGGGG 
305101 GTTTTCAATG ACTGCTTAGA TATGAGACAC CAGGGCACAT GCTCACTCCT 
305151 CATGCATGTG TAATCCAGAA AGGGCAGGGA GTTAACACCC TTTGGAGTGA 
305201 ACTTTGAGCA AGGGCTGTTA GAATGTGTAG ATGCATTTTT CTCCCTTCCT 
305251 CTCCCTGGAT GAACTCTTCA CATTTATAAT AACGTCAGTT GGCCTCTTTG 
305301 AAGATGTCCT GCAAGGCTAG GCAAACAGCT TCGCTAAGCT GGACAACTCT 
305351 ATAGTACACT CCATTATATT TGTTTCCCCT TACACTCCTT TTCCTTCACT 
305401 TCTGCTGTTT TGGGACTGCG CCATCTCCCA ATAAAATGTT AGCACAAGAT 
305451 GCTTTTGCAA AGGCTTTTTT TTTTTTCTGG GTTAAGAGCA GGGACTGGGC 
3 05501 TGTGAAGGAG AGCCATAGGC TAAGATGATA GGACCCACAA TTTTCTTAGT 
305551 GTCTGGTAGT AATCCCAGGT ATCCATCAGT TCAGGAAGGT AGGTAGAAAC 
305601 TATAAGTAGA TTAAAACAGG GTAGCAAGAA GGACTTCTGA AAAAGTAGTG 
305651 AGAATAGATC TGACAGCTTC CACCTTTATG GAATGAAAGC TTCATCCTGT 
305701 AGATTGGTTC TTGTGATGCT TATTTAAAAT AAATTTCCTT AAATGAGGCC 
305751 CAACATGAAC CAATTCTCCA CCCCTGTGAA AATTTTCTAT TGCCAGTTTG 
305801 AACTTCTATG GGTGATCATA CAATCGCATA GCTTCTGAGG AGCTAACTTT 
305851 CACTCAGAAC TTAGGATTCA TCTGTTGTCA CCTCCTCTGG GGAGAACTCA 
305901 CTGGCCCCTC TCCTTTCTGC CAGCTGTTGA GAATATTAGG GTTCCCTCTT 
305951 CTGTGTTTCA GAGGTATTTT CTCATTTCTA TCGTGGCACA TTTTATGCTG 
306001 TTTCTGCACA ATGCAGAGAG ATCCTGAAAG CAGAGAATAT GCCTTAGCCA 
306051 CTGCTGAATT TCTAGAAACT AACAGAAGCC TGCCACACAG TAGCCATTTA 
3 06101 ACATGGATTG ATTCCATTGA CTAAATTAAA GAATTCATGA GGAGCTCTGA 
306151 TCATGGCCTG AGGTTTGCAG TTACCCTATT GAACAAATCC CAGAAGAGAA 
306201 TTTGCGGAAT TAGGTCTACA TGTAATACAC ATTTACCCAC ATGATGAAAT 
306251 AAGATGACAA GAGCTTTGTG GTATCAAAAT ATCTATGTCT CTGACAATTT 
3 06301 TCACAACTTT ACTTAATTAT ATTAGGGTTA AGTATTGGAA AGCCTTGATA 
3 06351 CAATGCTAAT TTGGAAGCAT TATACCAAAG TGTTAGATCA GATTAGCCCC 
306401 AGTTCTCTTC TAAATAGTTC CAGAAACCAG ATTTGGACCC AGTCACGTAA 
306451 CGGCAGTAAA AACCAAGACC TGTCTATCAT GCTACACTGC TTTCTTCAGA 
3 06501 TGACAGGGGC CTTAAGAGCA AAGTAAGATA TTGATTGGCC TCCAGGATCC 
3 06551 AGGAAAAACA AAACAAAACA AAACAAAAAA GCCAGAAAGA ATACAAGGTG 
306601 AAGGCAAGAC TGAAAGGAAA ATAGTTTTGA TGTACTAGGA AACAAAATCA 
306651 TTGAATCTGA AACACTGTTG TCTTTGGAAT ATCCTGAGAG AAAGGAGCCC 
306701 AACACAGGTC ATTTGGCTGG AAGGAAGATC AAGGCAGGAG TCCGTCTGGT 
306751 GCGTGGGGTT CTCCATGGGA GACTGTCAAC AGCTAAGGGC AGCGGAGGGC 
306801 AATGATGAAT GCCTCAGAGC CGAGGGCCAC AGAAAAGGCA CTGGGAACAC 
306851 TTTCCATGCA CTTCATACCT ATGACTGAGA GGATGGCCTA GGCAACCTGC 
306901 TTTAGGGTGG AAACTATAGT TACAAAGATG AGCAGTCACT TAATTTACTA 
306951 ATGAATGACC CCAGGTCTCA TAGCTTTTCT TTAGGAAGGA TTATATAGTA 
307001 GTTAAAAAGT GCATGGTTCC CAGATATAGA AAAATCTGGG TATTCACCTT 
307051 GGCTGTACCA TTTACTGTGT GAACATCTCA GGCTACTATC TTTACCTCCA 
307101 TTTGTCTCAG CTGGCACCTC CTTAAAGCAA AAATAATAAT ATTACTTCAG 
307151 AGGGTTATTG TGGACATTAG AGAAATATAT GTGAAACACC TGTCCAAGTA 
307201 GCAATGGTGA GTGTGCAATG AATACTAGTT GTTTATCATG TGATTATTAG 
307251 TCTAATTCTT GCAAAATGTC CCAGCCTTTG TGTTTTGATG AACAAGAATA 
307301 TATTTTTAAA AAATTAAAAA ATTTTGTTGT CCTCATAATA TAGACATCTG 
307351 CTCCCTGGGA CCTAAAAAGC AATCCATAAT TTTCCAAAGC TGCTAAAAAA 
307401 TCTGCCATGT TCTAAATGAC CTTTATCAAG CGATCTCCTA GTCTAGAGAG 
307451 GCGGAGGAGT AACTTGAAGG TGAAATACCC CGTTAGCAAT CCCCCAGGTG 
307501 AATAGAAAAT TTAAAATAAT ATTCTAATGA TTCATTGTAC ATTACTACGC 
307551 CCCATTAGGT GTCTACATTA CATCTTGGGG CACAATCAAT ATCTGTCATC 
307601 CTGCTTATGG TTTTGGTGCT AGAGCGGTTC AATTTCATTT TGCAACTCTC 
307651 ATGCTCATGA AGCTGTATCT CGGGTGTCAT ATGATAAAAT ATTACTCAGA 
307701 AGTACGTATT CAGTCCTAAA GCATAGTGGG AAATAATTGA TGAAGAAAGT 
3 07751 GTCTTTCCAT CTGGTCAAAC ACCATTAAAG TGAGGATACG ATTTATATGA 
307801 GAAAAAACGA AGATTGATAT TCCTTTTTTT TTTTTTTTTT TTTGAGACAG 
307851 AGTCTCAATC CATCCCCTAG GCTGGAGTGT AGTGGTGCAA TCTTGGCTCA 
307901 CTGCAACCTC TGCCTCCTGG GTTCAAGTGG TTCTCCTGCT TTAGCCTCCC 
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307951 GAGTAGCTGG GATTACAGGC 
308001 TTTATTTTAT TTATTTTATT 
308051 GCCAGGCTGG TCTTGAACTC 
308101 CCAAAGTGCT GGGATAACAG 
308151 GATATTTTAT GAAACTTTTT 
308201 TTCTGTACTC AACATAGTTC 
308251 GG ATTGCTAC TTACAAAATT 
308301 ATCCCCTTTA TAAATACAGG 
308351 TTAAAATTTA AAAAATCATA 
308401 GATTCCAAAA C^GTCATGGA 
3 08451 ACAAAATTAA TGAATGAGTT 
308501 CAGAATTGAA ATGCCAAAGG 
308551 ACAAGCATAC TCCAATCGAA 
308601 AGTACCTAGT AAGTACTAAC 
308651 AGAATTCAGA CATCCTTGGG 
308701 CTTTCACATG AATTTTTGAA 
308751 GAGATCAGAC TAAGTGTGAA 
3 08801 TGTGGAATAG GACATGCCAA 
3 08851 GATGACATAA AAGTGCGTGT 
308901 GTGAAGTATA AAGGGCCCAT 
308951 TGAAAAATGG ATTTAACTTG 
309001 TGCTGCACCC TTTGCAATGG 
309051 CATGCACAGA TGCTCTAAGT 
309101 AGGGATGTGC ACAGCTAGTA 
3 09151 TACTAAGTGG GACAGTGGTT 
309201 AAATGACTGG AAAAGTGGCT 
309251 TGGATATACT TTATAACCAG 
309301 TAATTTATGT CATTTCTTTT 
309351 CTGATATATC AATGACACGT 
309401 TGAAGTGAAA GTCAGTCAAT 
309451 CCAAATTTGA GAAAAGAGGT 
309501 TCCCAAACAA TATGTTGTTT 
309551 GGGGGTTGGA TGCAGGATTA 
3 09601 AACTCTGGTT GGGACAATCT 
309651 TGAGATTGAC TGGGCTCTCC 
30S701 TGACTTTTGT CCCATAAAAA 
3 09751 ATAATTATTC ATCAGTTTCT 
309801 GATCCCTTGG TATATTTTAG 
309851 AGTAAAGAGC TTACCTTGAT 
309901 AGTAAAGTGA ATATTCTACT 
3 09951 ACTACCTCCC TACTGTCACA 
310001 GACTGGTGTG GGGTGATTTT 
310051 AGTCCTGAAT TTTAGAATTT 
310101 TTCTATTTAG TCTTCTATTT 
310151 GTTACTATGA GACAAATGTT 
310201 CAATTGCTTC TTGAAGTGCC 
310251 TTTTCTCTTT CTGGATTTGA 
310301 TCATGTTAAC TTCCTGCCCC 
310351 ACTAAGCCCT TCTCCTCACC 
310401 CTATTGCTTT TCCCAAGAGT 
310451 ATACCTTACT GAGCTTTAAG 
310501 AAAGTAGCTT GCATTCATTG 
310551 AACACATGTC ATGTATATGT 
310601 AGATTCTTCC TCTACCCTCT 
310651 TTAAACTGAG GTGGCTGGGC 
310701 TTGGGAGGCT GAGGCAGGCA 
310751 TGGCCAATAT GGCAAAACCC 
310801 GGCATGGTGG CGGGTGCCTA 
310851 GAGAATCGCT TGAACCGGGG 
310901 CCACTGCACT CCAGCCTGGG 
310951 AAAAAAAAAA AAAAAAAAAG 



ACGCACCACC ACGCTCAGCT AATTTTTGTA 
TTTTTAGTAG AGACGGGGTT TATCATATTG 
CTGACCTTAT GATCTGCCTG CCTTGGCCTC 
GCGTGAGCCA CTGCACCCAG CCAAAAGACT 
GGTAAATCCA CGTGAGAGAG AAAGTTTTTA 
CATTACAAGC ATTTTTCAAA CATTTAAACT 
TTAATATAGA CAAGACTAGT TCATGAACAT 
TACCAGTACG TAGACATATA TTTCTTGTAT 
CTACATTTTA GAATATGGCA AAGCTTAAGA 
TAAGTAATAA GTCACTTGTC ATTGTTCTAT 
AAGTAACCTA TGTTTTGAGC TGTTTCCTTT 
AGTGACCCTC TTTTTCTCAT GTGGGTGCAC 
ATATCTTCCC CACACATTCT ACCATGTGGT 
CTACTCACTA GGTTAGTAGA GACTTGTATT 
AGGGTTAAAC CCACGATCAA GTGACTCATA 
AACTTTAAAT GTATGATTTC TATGCAAATT 
TTCTACTTAA AAGAGAGGAT TTCGGAGTAC 
GGCCCTTGCC ACAGCTTGAG ACAGACAGAA 
GTGTCTGTGC AAGTGTGCGT GTGCATATGT 
GGAGGGCTAT AGAGATAGAT GGATAAGAGT 
GACTGCTCTT CTGTTCAAGA TGCTTTTGGC 
CTCTGGAAAA TCACCAATAG CATGGGGCTT 
GCTACTCGAC TTACCCAACT TTATAAAAGG 
CATCTGTGAT ATTTGCTTTG GGATCAGAAA 
TTCTTTGATA TAGGGAGATG TTCACCTTAA 
TGATTTGGTC TTCAATATCT AATCATGAAA 
GGTATATAGT TTGGGAGTTT TGACTGATTA 
AAAAGGTTGA GAGATTCAAA TGCCTTGATA 
TGGAGATGCA TTTTTTGGGT TTTAGCATAC 
AGAATATACA TTGATTCCAG GGTAAATGTC 
TATCCTGAAA GGGGAGAAAT TATCTCACGG 
CATTACACTG TTTGTGGGAC AAGAAGGGAA 
AGACAAGTGA AACTGTCACA TAGAAACATG 
CAAAATGCTG CAGGTGTTTG TAAAGCTTGC 
TATTACTTAC TTTTGATGCT GGATTTTGTC 
ATAAGACACT AGAAAGGCTT TTTGGTTTTT 
ATTTTCACAT AGCTTTGGAG CACTGTAGAG 
AATGCCTATG TGGGGGAAAG AATCACAAGT 
TTTTGGAATA GATATGTTTT AACCAAGTGT 
TTCTAGTCAT TTGTTACAGA ATCAGGCACT 
GTAAGTAGCT GTAGTCTTTC TATAACTTAG 
TGATGGGTAA ATTTTCAACA TTCTTATTTC 
TGTTGACATA CACAGGATTC TTCAATAGCC 
GCCACTTCTT ATTTCCAACT CATCTTATAT 
CATTAAACAT TACTTTTGTT TGTTTTTGAG 
AGTTTTCTAT TGCTCTTGTG TATAATCCAC 
AGGTCATCCT CAGTTTTGCT CCACTTGATC 
TCCTTATCAT GAAAAGTACT CCAGCCTGCA 
CTCTATCGTT GTCTCTATGC CTGCCTTATG 
ACCCACCCCA CCCTCTTCTT CCTCCTTTTT 
TTTCACCTTA ATTTCTACCT GTATAATAAC 
AGACTCCACA ATAGGCCAGC CACTATTCTC 
TGTTAAATGT TGATATCACC CTATTGGGGA 
AAGGGTCTCT GGCAGTACCT AAGAATTAAA 
GTGGTGGCTC ATGCCTGTAA TCCCAGTGCT 
GATCACAAGG TCAGAAGTTC GAGACCAGAC 
CATTTCTACT AAAAATATAA AAATTAGCCG 
TAGTCCCAGC TACTCGGGAG GCTGACTCAG 
AGGCAGAGGT TGCAGTGAGC CAACATCGCG 
CAACAGAGTG AGACTCTGTC TCAAAAAAAA 
AAGGAAACTG AGGTAACACA GATCAACAGG 
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311*001 AGAATATCAT ACACAATTTG CTAAAATTTT ACATATATGT GGAAGTCCTT 
311051 ACAAAACAAA ATAAAAAAAT ATAGACCCAA AGAAGTGACC AAAACAGATA 
311101 GCTTATATAC CTTTTAGACA AAGAAAGGAT AAATTTTTAA AGAAATGATA 
311151 AGACAAAGGG ATTTGGGCTA GGAGCAATAA ATTGTAGGGA AGTAACTAAC 
311201 AGATATATTG GGGAAAACTA GTAGAAGATA AAGGTTATTT TAGCAAGCTT 
311251 GTTTGAACAG ATTCATTTCA GCATTGATTC CCAGTCTCTG GTGATTAGAG 
311301 TGTTCCTCTT TTCTTGGTAC AGGGAGAGCA CCTTTCTCAA ATGAAATTTA 
311351 TGTGTTTTTT TTTTTAATGG CCCATTTTTA GGTAGCAAAG GGGAGGTCAG 
311401 AGACCCCTTC CTGCATCTGC TGTTCCTCAA GTATCTCCAA CTCAAAATAA 
311451 TCAATATGCC CAAATAGCAT ATTTTGAGGT GGCATGTCCT GAACTCCTTC 
311501 AAACTGTATA AGGTAGTTCT ACATCTCCAT TTTACAAATA ATGGCATCAA 
311551 GGCACAAAGG AATAAACTCA CTAAGGGACA CAGTTGTGAA GTCACTCCGC 
311601- TGGGACACAA AGCCAGCTGC CTGCCCCACA CCCTGTTCTC TCTGCCACTC 
311651 TGCTGCCCTT CCCTTCAGTC TTCTCGGACT CTCCAGCTCA TTCTGCTTTC 
311701 CTCTATCTAT GAACTTATAT GGTAGCTGTT TTCAAAACCG TACAGTTTAT 
311751 CATTTCATTG TGACGAGTTC ATGCATGTTA GTTTTTTCTG AGCAATTACA 
311801 TAGCAAGATA TCTTGAAATG ATTTCTATGC CTCATTCTAC TTTCTTTCCA 
311851 AAGCAGTCCT GGCTCCTGGT AAGAACTCAT TAAATACTTT GTGATTGATA 
311901 GATAATACAA ATAATAGAGA TAGGAGGCAG GGGAATTCTG GGCAGAAGAA 
311951 GGCAGGTCCC CGGCAAGGGC CCCAACCTCA AGCCTGAAAC CACAGCCCAA 
312001 AGTGAGAACA TACATACCTG TTTTCCCACT CAAATGTTGC CTTTTCCAAA 
312051 ACCACCCATG GCCTGTCCCA CTCCCAATCC TGTGCCCATA AAAACCCCAG 
312101 AACTCAGCCA GCAGAGAGGA GAAGCAGCTG GATGTGGAAG ACTATGGTTA 
312151 GATGTCAGAG AGAAGTGACT TGACTTCAGA GGGACAGCTT GATGGCATAG 
312201 CTTTGGAGAG GAATCCGGCC GTTCCTCCTC CATCCCCTTT CCAGCTCGCC 
312251 TTCCTGCTGA GAGCCACTTT CATCGGTAAT AAAATCCCCT GCACTTACCA 
312301 TCTCCAATTT GTTCCTGTGA CCTTATTCCT CCTGGATGCC AGACAAGAAT 
312351 TTAGGTGCCA TGAGTGCGGG TACAAAAGGC TGTCACACTG ACCTTCCACT 
312401 GAGCTTTTAA CACTTAAGCC ATCCAGCCAT CCACAGATGG CAAAGTTGAA 
312451 AGGGCACTGT AACACTCCTT CTAGGGCTTC AGGTGTTGCA GGCACCCCCC 
312501 TAGATGCTGC TGTGGGGTCT GCACAAAGTT TTGCT.CCTGC CGGCACCCAA 
312551 AAACACTCAT CCTGGCTCCC GTGTCCACTC ACCTGCTCTC CCCCTCCCAC 
312601 AAAAGGTGGA ACACAACTGG GCCAAGCAAG TGGAGTCTGA CCCTGCCAGC 
312651 ACCAAAGGAG CCAGCTAGCC TGTGAAGGCA TCAGGGAAGT ATCATGCTTC 
312701 ACAAAGACAG CTCTCCTCCC CTCCTATGAT ACAAACAATT GTATTGATTT 
312751 TTGTAATTCA CTCTGACTAT ATAGACCAAG TGCCTTCAAG CACGGTGACA 
312801 TACTAAAAAG TATACATTTC ATGTGACTGA TCTACAAAGT AGCCCTCAGC 
312851 TTAGCTGAGA GTTGACTAGC ATCCTGCAAT GGTTTCTGAG TTGGGCTCCA 
312901 GTGGGTCTGA ATTATTCAGT TTTCTTTTTT TTCTTTTTTT TTCTTTTTTT 
312951 TTTTTTTTGA GACAGGGTCT CACTCTGTTG CCCAGGCTGG AGTACAGTGG 
313001 TGTGATCTTG GCTCACTGCA GCCTCCACCT CCTGGGCTCA AGTGATAATC 
313051 CCGCCTCAGC CTCTCGAATA GCTGAGACTA CAGGCATGCA CTACAACACC 
313101 CCAGGTAATT TTTGTAGTTT TTGTAGAGAC AGAGTTTTAC CGTGTTGTCC 
313151 CGGCTGGCCT CAAATTCCTG GGCTCAAGCC ATCCACATGC CTCAGCCTCC 
313201 TAAAGTGCTG GCATTACACG TGTGAATCAC TATGGGCCAG CCTTCAGTTT 
313251 TCATTTGCAT TGTTCTGAAA CAATTTTTTA ACATTCCATG TTATACAAAC 
313301 ATCACACAAA CAAGAAGATT ACACCATTTG TGATACGTAC GTGGTTATAG 
313351 TTCACATAAT AGGAGACACT ATTCAACTAC CTTAAAATCA GCTTACTGAG 
313401 CACCAGAAAC AACATGATTC ATATATTGGC TTAAAAGTCC AGTATCACTG 
313451 TGATTTTCCT GATTTTTTCT TTTTTTAAAA AAAACCATAT GATGATTATG 
313501 ACTTGAGGTT CTATCCGGGA TTGATACTGG AATAATTCTT GAAGGTTGGT 
313551 GGTTGGGGAC AGAGACCTGG TAATCAGGCA GACCTGATTT GGAATCTCAG 
313601 CTGTACCACT GATTAACGTG GCAATTTTAG CAAGAAACTT AGCTTCTCTG 
313651 AACAACTACT TCTTGATATC TAAAGTGGGA ATAATAACCC CTACTCTGCA 
313701 GGGTTGCTAT AAGAATTAAA TTCAATAATG CATATAAAGG AATTATACTT 
313751 TCTACCACAT GGTAATTGCT CAATAAAATG TTTTTATTAG TGGTAATCAT 
313801 GTATCCATAA GCATAGTAAT TGATATATGC TGACTCACCT GAAAAAATGG 
313851 CAAGTAGAAT TTGGAAATGA CAGCCTCTTT TCTTCCTTTT TTCCTTCTTT 
313901 GCTTCCATCC TTCCTTCCTT TCCTCTTTCT TTCCTTTCTT CTTTTTTCCA 
313951 TCTCTCTCTC CTCCTTTTCT TTCTCTCTTC TTTGTTTCTT TCTGAGGCAT 
314001 AATTTACATA CAATAACATG TATTTATTTT GATGTACAAT TTGATGGATT 
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314051 TTGGTCATTG TATGCAGTCA AGTAAACGTC CTTACACTGA AGCTATAAAA 
314101 TAGTGCCATC ACCCTAAAAC ATTTCCTCAT GTCCTTTTGC AATCAACTCC 
314151 TCACTTCCAT TCTCAACCTC AGGTAACCAC TCATCTGTTT TTGATCACTA 
314201 TAGTTTGGTC ATTTCTAGAA TTCCATATAC ATGGAAACGT AAATTATATA 
314251 GTTTTTTACA TTCGATTTCT TTCACTTAGC ATACTTTTTG. TGATTGACTC 
314301 ATGTTAAGTG - TAATAATATT TTGTTTATTT TTATTGCTAG TAATGTTTCA 
314351 TGGTATGAAT ATATCACAGT TTGTCTATTC ACTAGCTGAT AGACATTTGG 
314401 GTTGTTTTAA TTTTTGGCTT TAGGAATACT GTTGCTATAA ACACTTGATT 
314451 GCAAGCTTTC GTGTGAACTT ACTCTCCTTA CAGGGCTTAG TGTGCCCAGA 
314501 AAAAGGACAC TGGGTTACAA TGGGAAGTAT AGCACATCAG AGCATCTAAG 
314551 TTGTGTCTTC TCCTTGCCTC TAGTGATGAT AATGAGAGTA ATAAAGCCGA 
314601 CTTTCATTTA TCTAGCCTGT TTCCTCCTCT CAGAGAGGCT GCTTTCACAG 
314651 CCCCCACTGT CCCCAAAGAG TCAGGTTAGC TTGTCCCCAT CTCATCACTG 
314701 GTTCATTGAG CTGTGGTAGA CACCTGCCTC AAACTGGACC AAATGGATTT 
314751 TCTACCCTGA GAACGTGAAA TTCAGAAACA GGGTCTTTTA AAATTAATTA 
314801 TGTATGCTTA AACTAGAGGT AAGATAGACT TAGCTAGGAT GGGCAATTTG 
314851 AGTCGAATAC CAAGACAACC GACATCTCCA GTGTGAGGAA AGAGACAGCG 
314901 AGAGTGAGAA AGAGAGAGAG AGAGAGAGAA AGAGAGAGAG AGTGAAACTG 
314951 CAGATAAGCA ATGAAATCCA GAAATAAGAG ACTAGATTTC AAGACAGAGA 
315001 AGTGAGCTAT ATTTATTTCA CCCTCTTTCA GTACTGTAAG ATGCCACAAA 
315051 ATCCTTTAAA CTCTGTGGTT TTGCCAGATG CTCTGAAAGT GTTCCAGGGA 
315101 ACTCTCTCTT TGGTTTGAAC TAGGTTCAGC AGGCTTTTGT TTCTGACAAA 
315151 TAAAATAACC AAATAATCCC TGACTGAGAC AAATCATAAT TGTTATGAAG 
315201 ATTTGTTGAA TCCTAACTAC CTACCTACTT TAATAATAAT TCTATTAATC 
315251 CTTTCTTACA TCGGATCACC AAAACAACCC TGTAAGGAAG GATTTTTCTT 
315301 TGAAGGAACT GAACTAAGAA ATGAGTTCAG TTAGTCGAGA AATTAAATAA 
315351 TTTGTATAAG GTTACATGTT TGGAGGGTAG CAGGCCTGGA GACTAAATTC 
315401 TGAACTGCTT TTCCCCAAGG TCTCTGATGG AAATAGCAGT CCCATGCTGG 
315451 ACCCTGGGTG TGACATGAAG GAGACAGCGT AGGGCTCCAT ACAGCTCCCC 
315501 CAGCTTGGTG ATCTCTCTTT ATATATCTAA GAAAGTGAGT GGAGTAATAG 
315551 GCATTGAGAG GGCTCCTGTC TATGGACTCC ACTTTCTCTG TGAATCTAGA 
315601 TTATTTGCTG TTAATAAGGG AAGATTTGAT AGTGTTGGAT ATTTGAAGGA 
315651 AGTCATTTGT GAATGCTATC AGATGCCAAT TTGGAAATGC AGAAAGATTA 
315701- TAAGCCAGTA TGGTCCTATA AGGACTATAT TTCATGTAAG TAAAGTTGAG 
315751 AGAAATAGAG ACATTTCCCT CTTAAATGTT TGCCTGTTGG .TTTGTTTTTA 
315801 CATTCATTTT GTGGAGGAAA CACATGACTT TAAAAAAATG TAAACTATTT 
315851 CTGTTTCTTC TTATGAGTGA ATCGTGGTGG GAAAGACTTC ATGGACAGAG 
315901 TGAAAATGAA AGAAATGCTT TCTAAAGCAA AACACAGAAA AGGAGAAAGC 
315951 TGGCTAGGAG GTGATAAGGA ATATGAAAAG ACTCCAGGAG TTATAAGAGA 
316001 GAACTCAGAA GTTACAGTGA TTCCCCTGCC TGGTGGGCTT CAGAAGCATT 
316051 TGTAGAATGA GTGCAGCATA AGTTTTAGTC ATGCAATTTG GATGTGAATA 
316101 GAGACTTTGC ACTATCTGGC CTTTCACCAC TTCCTTTTCT TTTCTCAAGT 
316151 TGGGCCTTCT CATCAGAAAC TGAGGATTGT TACACCTCAC AGGCTAGTTG 
316201 TATCAACTAA ATGGGCTGTC CAGGAGTGGT GGCTCATGCC TGTAATCCCA 
316251 GCACTTTGGG AGGCCAAGGT GGGTGGATCA CGAGGTAAGG AGTTTGAGCC 
316301 CAGCCTGACC AACATGGCGA AAACCTGTCT CTACTAAAAA TACAAAAATT 
316351 AGCTGGGCAT GGTTGTGTGT GCCAGTAATC CCAGCTACTC AGGAGGCTGA 
316401 GGCAGGAGAA TTGCTTGAAC CCGGGAGGCA GAGGTTGCAG TGAGCAAGAC 
316451 TGCGCCATTG CACTCCAGCC TGGGTGACAG AGCAAGACTC TGTCTCAAAA 
316501 AAACAAAACA AAAAAACAAA AAACAAACAA AAATCCTAAA TGGGTTAATG 
316551 CAAGTAAAAT ACTAAGCCTG CTGATATAGA GCAGGTGCTC AATAAATGTC 
316601 AGGTTTCCTA TCACCACTTC ATACTGTTCT TTGAGACATT CATTGGTTTT 
316651 TCAACTGTTT GCCTTTAGGA GGATTTCTTC TGGTCTGGAC AATCGAAGAG 
316701 GTTCACAGTG ATGCACGTCA TTTCCTGGCC TACAGTGAAG TGACAATTGT 
316751 ACACTCAAAC ACAAATCTAG ATGTAAATAT CACAGACTTA TTTGTAGTTA 
316801 TTAATTTAAA TTGGTTAAAA CACCTTCTTA GTTACAGATA CCAAAGAGCT 
316851 TAAGTATGTG GTTTAAAGCA TTAGACGCAA AAGGTATTAC AGACGCATTC 
316901 AGGGATGATG GGTATCCCCG GATCCTAAGG CCTAGTGAAT GGTCTCTTCT 
316951 GTAGAAGACA GAGAGGATAA GAGTGGGGAC TAACAATTCA ATCTTATACC 
317001 AAAGATAAAT AAGTATGTAT AGATATCAAC ATAGGTATAT AGGTATAGAT 
317051 AAATAGGTAT AAATATATAT GGATATAGTT CTAAAGCATA CTACTTTACA 
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317101 
317151 
317201 
317251 
317301 
317351 
317401 
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CTCTGCATAC 
TATTCTTTTG 
TATTAGCAGG 
AAAGTTTCAT 
GTCAGAGTAG 
CAATTTAAAA 
CTCTGAAAGT 
TATGTAGCTG 
TTCCAAGATC 
GATTATTTGT 
AGGATATGTC 
ATAGCTCAAA 
ACACGAGCAG 
CCAGGCATAC 
TTCCAGATGA 
TATTCCAGCA 
GATCTGCAGA 
GTTGTCTTTG 
TAGAAATAAG 
TCAGCCTTTT 
ACCCTGATGT 
TTTAAGGGTC 
CACATGTGGA 
CTCTACACAT 
ACAAACTTGG 
GAGAGCCTAG 
TCTCGATTTT 
TACTATACAG 
TTCCCTCTCC 
CTTTACATAT 
GTACTAATTC 
AGTTTTGTAA 
AGATGTTGTC 
TGAGGAATGC 
GATTAGAAAT 
CTGTTCAAGG 
AGAAGAAAAG 
AAGTTGAGAA 
ACAGTGGGAT 
TTCTTAATTC 
CTCTCAACTT 
TAACGCCTGT 
GGTCAGGAGT 
CTAAAAATAC 
GCTACTCAGG 
GTGGCAGTGA 
GTGAAACTCT 
ACGAGACCAA 
GGAAGGAAGG 
CAAAATAAAA 
GCTAAGACCT 
CCCCTTTTAG 
GGACAGGTGA 
AAAAGATCTT 
CAGCATAATT 
AAGTGCATAA 
GGGTTTGTGT 
TCTCCAAGGT 
AACACATTCT 
ATGAAATAAT 
CATATTGAGT 



AAAATATATT 
TGCAGAGAGA 
AATTCAGGCC 
AATTGGGTCC 
GCTTTTGACA 
ACGGTTGGAA 
ACAGCAGATA 
ATATCAGGGA 
TCAACTGAGC 
AAAGATGCTG 
TCCATCAGCT 
CTTCTCCTGC 
CCCAGGCATG 
ATTGTCTGTG 
TGAGTTTTCC 
TCTACCTTTA 
TGCCTGTGGA 
AACATAGCAA 
AAAATAGCAT 
CATGAAACAT 
TAGTTATATA 
TTTTCCAGTC 
CTTCTCATTG 
ACTTTAGGAC 
TGATATAGAA 
TGTGGTTCAG 
CTCATCTGTA 
GAAAGTGCTA 
TGGCACAATG 
CATGAGTTAC 
GTCATCACCA 
AGATGTGTAA 
CTCATCAAGG 
CTGCACCTCT 
ATAAATTATG 
AGTTGCATTT 
GAGGAAGGGG 
GAGGGAAAGT 
TATTATTTCC 
TGTCTTCATT 
GGGAATAAGA 
AATCCCAGCA 
TCGAGACCAG 
AAAAATTAGC 
AGGCTGAGGC 
GCCAAGATTG 
ACCTCAAAAA 
ATATATCTGA 
GAGGAAAAAG 
GCTTTGGAAT 
TTTCTCGTGT 
TTAGGTAAAC 
ATTAATATAC 
GAGAATATCT 
ATTCCCCCAA 
AATACGGGAA 
TATCCAAGGC 
TAAGTCTGAG 
ACCTCCATCA 
TTGCTCGTGG 
GATGAACCTG 
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AGTTGATAGC 
CTTTATAAAA 
CTGAGAAAGA 
TCCAGTGTTG 
TTTGCCCATT 
AGAAATCAGT 
AAAACAAAAC 
AGCCCAAAAA 
TTCTGAACCA 
TCTAATGATC 
CTTAACACAC 
AAGTTTGATC 
ATGGCATCAC 
AGCTTCAAAT 
ATAAATGTTG 
AAGGCAACTA 
GATGAGACTC 
TGAAGAGTTA 
AGTAAGGCTG 
AACTCAGATG 
ACCAGGGGAC 
TGCAAGACCC 
TTACTAAACT 
TTATTGATTT 
ACTGCATTTT 
ACAAACCTGA 
AAACTGGGCA 
AATACCGCAG 
CTATGGATGC 
CCAGAGTTTT 
AAGCATGAGC 
ACTTCCCAGG 
ATCAGGGACT 
GACCAGCCTC 
TAAAATGCAG 
TACTGCTTGG 
GAGGGAAGGG 
TGCTCCATAA 
CATTATCACA 
CTCACTTCAC 
AACTGGACCT 
CTTTGGGAGG 
CCTGACCAAC 
CAAGTGTGGT 
AGGAGAATTG 
CACCACTGCA 
AAAAAAAAAA 
ATATCTGAGG 
AAAGAAAGAA 
TGCTATAGAA 
CTGTATTTGG 
GTATTTTTAT 
TCATTGAAAA 
GGCGCAGTGT 
TTTATATTTC 
ACCTGGCTAC 
ATCTTTCCTC 
GAGTTTTCTA 
TAATGTGAAA 
CTAGCTGATA 
GACTAGATGC 
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TTTTTATATT 
TCAATGTGAC 
CAAGCTTATG 
TGCTAAATAC 
TATTCTTGCA 
GGCTATTTTT 
AGAACAAATA 
CCACCTTAGA 
CTCCCAGTCA 
AGTAGAAAGT 
GTTGGCTCCC 
CACAAAGTGA 
CCCAGAACAG 
TGCTTCTTGT 
TCTCTTGCTA 
TCAAAACAAT 
ATAATTACTC 
TTTTTATTTG 
CTCTTGACTT 
ATCTTCTTCT 
CACGGTGTCA 
AACCAATTTC 
AACCATCCCT 
CATGGACGTT 
CTGCTTTGGT 
TGAGATGCAC 
AGTTGTTATA 
GCACAGATGA 
AAACAAACAC 
CTCGTTCATG 
ATATTTAGAA 
TGCCAGTAGC 
GCTTGTTGTC 
AAACACTGCT 
ATCATTGACT 
AATACCTTTT 
AGGAGAAAAA 
TTTACAAGGC 
TGAAACAATC 
CTTGTCCTTT 
TTACAGCTGG 
CCGAGGCAGG 
ATGGTGAAAC 
GGTGGGCACC 
CTTGAAACCG 
CTCCAGCCTG 
AAAAAAGAAA 
CTTTAGAAAA 
AGAAAGAAAG 
AGGATTAGAT 
GQAAGAGAAG 
ATTGAGAACC 
TGAAATGACT 
TTTTGAAACT 
AGTTAAGTAA 
TAGGGTTTGA 
CCCTTCGCTG 
GAAAACCTCA 
AGAATCTGTG 
AAAAGCACTT 
ATGAAGTACC 



TAGCCTTGAA 
TCCAAGAAAG 
GAGCAGGTAA 
AGACAGAACT 
CTAAAATGAG 
GAACACCATC 
TATGGAAAAA 
GTTACACATT 
AATCAAACTC 
GAAGACACAC 
AAACCTTACA 
TCCTGAACTA 
AACACTTCCT 
CCTGTATTGT 
GAAAATGTTA 
AATCTTTAAA 
TGATGAAATT 
GGAAAAAGTG 
TCGTACAGCT 
GAATTAGTGA 
GCCTTGTTAT 
CTTCACCTTT 
TCCTGCTCAT 
CATTTGTCTC 
GCACCCCTTA 
AACTTGGAGG 
AGGATTGTGA 
AAGGAAATCT 
ATTGTTTTAC 
TATGAAGGGA 
CTGGATTTTG 
AGGCCTGTGT 
ACGTTCCTGT 
GTAGAGGAAA 
GAACATTTGA 
AAAGTGAAGG 
GAAGAGGAGG 
TCTTFTTTTC 
TTCATTACTT 
TATATATGTT 
GCACGGTGGC 
TGGATCATGA 
CCTGTCTCTA 
TGTAATCCCA 
GAAGGCAGAG 
GGCAACAAGA 
CTAGAGCTTT 
ACAGAAGAAA 
AAGAAACAGA 
TAGATGACTA 
GCATGAATTT 
TCTTGCATAA 
TCAGAGCTCA 
TTTAAGAAAT 
AATTGACTCT 
ATCGAAAGCA 
TCCTTCTCTT 
AATTCAGTGG 
GTCTAATCTG 
GTAGCTAACA 
TAATCTCCCC 



WO 02/086113 




'CT/GB02/01887 



- 106/219 



320151 ACCAACCCTG TGATGTACAC ACTCTTATGA CACTAATTTT ATAGAGAAGG 
320201 AAACTGATTT CAGAGAGCTT AAGTGATTAG AAGCAAATCC ATGACTAGAC 
320251 CTTGGTGTCC TTAACCCTCA GGGCAATGCT TTGGCACTCT ACAATCTTAC 
320301 CTCCTAGAAA CACCTTCAAA ACCACCCTAT GATTAGAACT GAATTTTAGC 
320351 ATCTACAAAA ATTCACAGTT CACTTTTAGT TCTCTATTTT CCTGGGTTCT 
320401 TCCCAGGAAT TTAAGTTATT TTGAGACCAG AGAAGCTTAA GGCATTGACA 
320451 TTCCACATGC CCTCAGGATT TAACTTAAGC TCCAGGCTGT TTCAATACCA 
320501 AAGATTTTGC CTCAAACTTT GCCACCACTG AGAACTTGGG GATGTGAAAA 
320551 CCACTGAGGC AGCAATGGCA AAGAGAAGAG TGGGAAAGGC CTTTCCAGGA 
320601 TCCATGTGCA GAATTAGATT TGTGTACCAC CTGTGATAAC TGGGAAAGAT 
320651 CTTTTTTTTC TGTTTTTTTT TTTTGTTCCT TTTTTTTTCC ACGTTGGAGG 
320701 CAGGAATAAA GCAGGTGAGG GTTCCATCAG AGTATCTGCC CTAATTTCCT 
320751 GTAACAGCAA CAGGAGTTAC CCATTACTCC TATTGCCAAT AGATATCATG 
320801 GCTTCACTCC TAAACCATAC TCAGAGCATT CACTTGTTGA TGTTAGTAAG 
320851 GAAAAGCAAG TTGTGCATAA GCAGTAGTAT ACAGCATGCT TCAGTACAGT 
320901 TTGTAGCTAT TTGCTTAACT GAAATCTAGG TACAAGCCTC TTCAAATAAC 
320951 AGATCTTCTT TTTCTGTATG TTCCTTAAAG AGAAAGAGGA AAGAGAAAAG 
321001 AAACATGCCT GGAGTGAGCA TTTATCACTA AGACAATTTC CAATTTCATC 
321051 TCACGATCAG GTTTACTTCC CAATATTCTA CAACCGTACC ATTTTATCAT 
321101 GGGCATAAAC ATTGAGAATA ATGACTCTTG TGTCTTTTCA GAGCCTAATT 
321151 TGTGATGAGA GAATTTTAAT GTTGGAAACA AACTTTAAAG GTACCGAAGT 
321201 CCAGACGTCT AATATACCCT AGAATCTCTC TGCAGCTCGT TGGCAACAGA 
321251 CATCTGGCCC AGCTGTGATG CTCTAATAAG TCCAGGCAAG AGGACCATAG 
321301 CACTTTAGAA AGACATGAAG CTTACTGAAG ATATGACTTA CAAATAGAGC 
321351 TTTTGGTACA GTGCAGTCTG AAACTGATTT TCCTAAAAAT AAAATGCTCA 
321401 CGTAGGTCTC AAATAAAGGT GCCTCGCTCC TCAGATAGAT AGAAAATAGT 
321451 ACAACACCCA ACCTAAATAG TCAAAATAAA AATATTGTGT ATCCAAAAAA 
321501 AAGTTTTAGA AATGTATTTA GTTACCACTT GGAGCCAGAG GTAGTTGCCA 
321551 ACCCAATTTC ATTATAAGTG AAATTTCTAG GGTATTAAAA ATGGCTCTCC 
321601 TCTTCTTCTT CCTCTCCCTT TTCCTTCTTC TTTTTCTGTT GATTTTTTAT 
321651 CATTTAATTT TGATCCACAA TTCTGTATAA CATCTACATC TCATATCAAC 
321701 CTTTTTTNTT TTCTAGAATT ATTCTACTCC CTGGGTCAGT CCCCAAGATG 
321751 TTTCACTTCA TTATGGATCT CCTTAGCGCT GGTGCTAAGG TTAAAACATT 
321801 AATTCTCCGC ATGCTGAGAG TTACTTAGCT CCATGGTGAA GTCTCTATCT 
321851 GCCAAAAGTC TCTGTTCAAA TAAATATGTA ACTGGTGATT CTACAGCATG 
321901 CATTAGAATC ATGCAGAGGG TTTGTTAAAA CACAGCCTGC TGGGCCCCAC 
321951 CCTCCTAAGT CTCTGATTCA GTGGATCTGA GGGAGCCCAA GAATTTGCAT 
322001 TTTTAGCATC TTCCCAGGTG CTGCAGATGT TACTGGCTCA GCAACAACAC 
322051 TTAGAACAAC TGTTCTAGAA AGAGAGTCCC TAGAACTGTT GCATTCTAGT 
3.22101 TAAAGATGGT GGGAATGGGA GGAAGACATT GCTCTATACC AGGCTGTTAT 
322151 GGGCAAATTC GGGAGATAAT TGTGATGAGG TTGAAAGGTT TGAATTTAAT 
322201 CTCCAAATTT CAGTCAGAGA GTAAGATTCA TTAAGGCTTT TTTCCATCCC 
322251 CCTGCAAAGT CCATTCTTCC AGCCAGCTCT TGCTCCAGCC CTGCTGTGAT 
322301 GAGACTCTGC TGTTTTCTGT TTTAGCTACT TCTGCAGCTC TCCCAACCCT 
322351 GGAAAAGATG AAGAAAGATG ATGATTTTGT TCTCAGCTCT TAGTCTTTAG 
322401 CCTTTGGACA GCCATCAAAA GTACATTTTA AGGAAAAGGG GGAAAATATG 
322451 TCTACTAGGA GAAATGCCTT GTATTAGACT ATTATATCAG TGATTTGCTT 
322501 TTACTTAATG TAGTATTGAG GAGAAAATGC CTCTCTTTTA TTATTCAAAT 
322551 GGTTTGCTTT TACTTTACCA AATTACAAGA AAGCATACCA AATGCAGGCT 
322601 TCTAAATAGT TTAAACTGGT TTACCACATT AAACTCAAAA GAGAGTATTT 
322651 CATTAAAAAA TATAAATGGA CTGGAGCTGT TTTCTGACAA TAGACTGTTG 
322701 GGCTTTTTTT AAACTTCTAC TTTTGTTTTA TGAACCGAAC TACTGGTGTT 
322751 TAATACTCAA AGATGCTCAG CTTTTAGGTA ^ ATATTTGTTG TAGGAATTCT 
322801 TAAAAGCCAC AGATACTGCT AATGGGGAAA ^ AAAC AACTTT TCTTATGTTG 
322851 AGGTTATAGC AACTTTGGTG GATAACAGTT AATACAATTA GTTTTCCATC 
322901 AGTTTCTCTA AATGCATTTT TTTTTTCTAA CTAACTTTAG GGTATTCTGT 
322951 ATCCTTGGCA TTGATTTAGT GAGATCTTCA TTACATGGCA TTTCACTTTA 
323001 GTTTCCATTG CAGATTGATA TCACAGGTAA TAACTGAATC TTTAGGGTTT 
323051 GAAGAAGCTG ACCTTTCTTT AAGAGATCTA ATCATTTCAC GGACAAGTTC 
323101 CAGTGGTATT TCTATTAAGA AATCAGGGTA TGGCTGGGCA CAGTGGCTCA 
323151 TGCCTGTAAT CCCAACACTT TGGGAGGCCG AGGCGGGAGG ATCACCTGAT 
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323201 GTCAGGAGTT CGAGACCAGC 
323251 TAAAAATATA AAATTAGCCA 
323301 TACTCGGGAG GCTGAGACAG 
323351 TGCAGTGAGC TGAGATCACG 
323401 TGAGACTCCG TCTCAAAAAA 
323451 ATAATCAGGA CTTTAAGGAA 
323501 TACCAGTTTA CATTATGGTT 
323551 TCTTGAAATG AAAACACATT 
323601 CAGCTGCCTC AGATACATGA 
323651 AAAGTTTAAC GGGAAAGCTG 
323701 AGCTGCAGAG CGGAGGCTCA 
323751 ACAAAAGGAA TGTGGAGGAC 
323801 TGTTTCTATA GGCAGTCATT 
323851 CAGTCATTTC ATTTGTAGAG 
323901 AAGATTTTTC AAATTTGGTT 
323951 TGAGCTATTC ATTTTTTTTT 
324001 CTCTATCACC CAGGCTGGAA 
324051 TCTGCCTCCC AGGTTCAGGT 
324101 GGGATTATAG GTACCCACCA 
324151 AGAGATGGTG TTTTGCCATG 
324201 CAGGTAGTCC ACCTGCCTCG 
324251 AGCCACCGTG CAGGGCCTAT 
324301 TAACTTTTGG GCCATGTTGG 
324351 TGTTTTCCCT TCTATTTTCA 
324401 TCTGAATTAG CAATTTTTCA 
324451 TCGTTCCAGG ATGCTTGCTC 
324501 CTTGTTTCTC CAGCTGTCCT 
324551 TGCTAGAGAT GCAGATCCCA 
324601 GCATTTTAGC AAGAGTTCCA 
324651 AATATTGGTA AATATTATGG 
324701 TTAGCTCTGC CACTATCTAA 
324751 TTTCAAACTG TGTTTCATGG 
324801 GCTGGGTAGT AAGGTCAGCA 
324851 TTTATTAGTT TTTATCTGTT 
324901 AAGGTTTGCC TGTGAAAGAG 
324951 AAAGTCATTT CCTGATCTGA 
325001 TAGGAATTGT TTCCTTAGCG 
325051 GTGGCAGGTT GACATTTGTC 
325101 CCTTTAGGAG GTTCCTTGAT 
325151 CTTCTCACTG TTTTCCTCAT 
325201 CTGTCACTAC AGAAGAACTG 
325251 TTATTTTATC AGTGTTGGTT 
325301 TTTGGCCAGT GCCATTCACT 
325351 GTTCCGAGGT CTGAGATACC 
325401 TCCAGTCCTG CACACCCTGG 
325451 ATTTTCAGTT GACAGTTTCT 
325501 ACAAAATTGT TTGCATTCTA 
325551 TCTAAATCCA GTTCCATCTT 
325601 ATTCATAGCA TGGAGCATCA 
325651 TGCATTGCCC TGTTACTTAT 
325701 TTTCTCCAAA GAACTCAGGC 
325751 AATATACCCT TACTCATTCA 
325801 TTCCTCTTCA TGTATTGCGA 
325851 AAATCTTCCC AGTAGCACGT 
325901 AAAAAGAAAA TTCTACTGAA 
325951 ATCAATCTCA TCAGTAGCTC 
326001 AGCAGATAAT CATCCCCTAA 
326051 AAATAAAATT AGCCTCTAAA 
326101 TTTATCTCAA GGGGATTTAT 
326151 TCTATTTAGG CCAACACACA 
326201 GAATGAGGAG TGAGTAAATG 



CTGGCCAACA TGGTGAAACC CCATCTCTGC 
GGTGTAGTGT CACATGCCTG TAGTCCCAGC 
GAGAATTGCT TGAACCTGGG AGACGGAGGC 
CCACTGCACT CCAGCCTGGG TGATATACAG 
AAAAAAAAAA TTGTCTGTCT TCATGGTAAG 
GTGTGTAAGA CATTTTACTC ATATTTTTAT 
TCTAAGGCCA TTATGTGAAT CCAAATACCA 
AAGCTATTTT TAATCTGCAA ATTATTTGAT 
AGAATCTGAT GGCAAGGGCA CAGAATAGCT 
GTACTATCCA AGTCAGTGAG GTGAACACCC 
GAAATGGTGA GTGCCAAGAT GAATCAAGCC 
CTCTGAGGTT TAGCTTCATA TATTGGGCAA 
GGCCTTTCGT CTTCGTCTTC TGCACTGATT 
TCTTCCTCAA CATATTGTTT AAAAAAATTC 
TTAGGATTTC AAAAACAATC AGGACTAGTA 
TTTTTTTTTT TTTTTTTAGA TAGAGTCCTG 
TGCAGTGGCC AATTTCAGTT TACTGCAACC 
GATTCTCCTG CCTCAACATC CTGAATAGTT 
CCACGCCTGG CTAATTTTTG TGTTTTTAGT 
TTGGCCAGGC TGGTCTCAAA CTCCCGGCCT 
GCCTCCCAAC GTGCTAGGAT TACAGGCATG 
TCATTGTTTT TTTGTTAATT CTCTAAGGAT 
ACTGAGAGAT TCTGTCCTTT GCTCTGCTTA 
GGTAGTGATT AAGACTGTGA AGACAGAAGG 
GGCAGCTGAC TTAATAATTT AGACTAAAAT 
ACCTCCTGCA TGTGGTGTCC TAGAAAGCCT 
CCCCGGCAGC ATTGGCATCA CCTGGGAGAT 
GGCCCTCGCA GATCCATGGA GTCATCACCT 
GGTGATTCGT GTGCACATTA CAGTTTGAAA 
TCATCATAGC GGGTTATCCA ACTACTGGTT 
TTGTGTGACC CTTTGCCAGT CAGCTCAGCT 
AGCTGTATCC ATTCAGTTTA TTACTCAGAG 
GCTCCCATCT GAGCCCTGAG CGACTCCAGT 
AGGCTTTAAC TTAAGACACT GTTTGAAGAA 
ATCGAAGGCC TACTGTCTCT GATGAGTCTT 
ATTTTCTATG ATTATAATAT CATTCATTGA 
AAAAGGAAAA ATCTCTTCAT TTCTATACAT 
AGGTACTGAG GCTGACAGAC AGTTTAGCTA 
CAAAATAGAG AAACTTTGCC AGCCACTGAA 
CTTAGCGACT TAATCCAGGC CTCACCCCAT 
TTTACGAGAC CTCACTTTAC TAGCTTTTAT 
ATTTGGGAGC AAACTGACCC TGAGCTATCC 
TTGCATACAT TGATCTAAAA AGGCAATTGG 
CAATGATATT CTACATGTGA CAAAGACATT 
TAACCTAAAG AAGGCCAAGC ATCTCAATTA 
GCAGCTGAGT CCCCAAGTCA GAGTCCCTAA 
TAGGAAAATA GGACACATTA TTAGCCCAGT 
TCCTCAAACA ACAGCAAATG TTCTTCTCAG 
TCCACTTTTA ATGTTTTCCA TCCACATAGC 
TTACAAATTT ATTGGCTTTA GATGGCATAT 
AGTTGAATGA GTTGTTTTAG AAGTCTACAA 
CCTGTGGGAT TTACATGAGG TTATTTCACA 
TGGGGTTGTT ATAAAGATAC TCACApCTTT 
ATATTCTGCT TTCAAACACA ACAACCAAGA 
ACAACTGACC AATCTTCTTT CCTTAGAAAA 
ATGAAATTCA ATTTCCCTGG TCAGGAACAA 
TAATATTTAT GGATGATAGA AATGCAGAGC 
ACTATCATCA AGTTCCTTGA AACTTTTTTC 
GGAGTAATGG TAAGTGTGTT TTTCTCCATT 
TCAGAGGCAC TCAATAGATA TCTGGTGAAT 
AATAAAGTAA AATACAGATG ACCCTCAATT 
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326251 TGGACATAAT GCATTCCTAG AGACTGCTTC ATAAAGGATA TTGCTTAAAG 
326301 GCAAAACACC GACTGCAAGA ATGCCACCAC TTTTGTCTGT AAAATTCATT 
326351 ACAAATTGAC AGCAGATAAA GTAGCATATA TATCTCTTCA TTAGGTAACA 
326401 GCAAATAAAT GGAAAGGCAA AAGAGGGCCA CAATGAAGCA TGAACAATGT 
326451 ATAGAATTCT AGTAAAAGAT GATTGCATAA GGACATTTTT ACCTTGCTTC 
326501 TCTTTATGAA AGTGTCCATC AATATGAAGA AAAAGAAGGG AAGAACCACC 
326551 ATCTTCTCCA TGACAGOATG CCTAAAGCTA GTCTAATCTT CTCTCTGTCT 
326601 TGAAAGGTTT TATGGATCCT GTGAGTCTGC TTATCACAGT CATTTGAAGA 
326651 GAGAAACAAC AACTGCCAAT CATGAAACAG AGGATGTGTG AGATATGGGC 
326701 TTGCTGTGAA CCAAGATGAT CCAAAGGGGA TGCTGAAAGC TTAGCAGAAC 
326751 CAAATACTCA AATAGAAAAA ATGTGGCTGA TTTGCTTCAA CAGCTTTCCG 
326801 AGATAATATA CCAAAGTGGA AACTGATAGC TTCTCCAGAT CATACTTGCT 
326851 CATTCTGTCC CCGATTCTTG GGCAGAACTC TGGTCTAGAG AGAACTTTCC 
326901 TTATTCCAAG AAGCACCGTG AGCAGAAGAC CTAAAACAAG ACCCATAGAG 
326951 TAAAGGGACA TGGGAAACAA AATGGCAAAT CTGACATCAG AAAAAAATGA 
327001 AGAAGAAAAT AAATAAAAAA TATAGCCTTC ATACTTTTAA GTAAAATGAA 
327051 TGTAAGCTAG TTGCAAAAGA CATAAAATTA ATAAAAGATA ACAACTTAAT 
327101 AAAAGATATC AAATCTAATC TGTCTCTCTC ACATGCACAG ACACACACAC 
327151 ACACACACAC ACACACACAC ACACACACAC AAAGTGTTGA AAGGATAATC 
327201 AGGCAGGCCG ATTTTAAGAA ATAAATGTAA AAATAAAGTC ATCAAAAACT 
327251 GTCACCGCAT TAAGAACCTA AAGGATTAAG AAAGACAACA TAGAAAATAG 
327301 CATCCATGAG CATAGAAGGC TTGAAATAAT CACAGAAAAA GAAATGAAAC 
327351 AAAAAAATTA GAAAGCGATT AGATACAATT TTATAGATAT GGAAGTCAGA 
327401 CATAGAAGAT ACAAATTATG ATATAAATTA TAGTCTAAAA AGAGAAAAAT 
327451 ATAACACAGG AAAATTTAAA ATATTTGTTA AAATAACATC TCCTGAAATC 
327501 AACTAAAACC AGAATCTTAA TGTTAAAAAA AGGTCTATTG TGTTTTAGGA 
327551 AAATTTTATG XGAAACATCT CCTGATGAAT AGAAAGTCTG TGGCCATCCA 
327601 AGCAGAAAAA TCAAGCTAGT CTAAAATTCT TCATGGCAGA ATTGAATGCC 
327651 AGAACACAGA GGAAAAATTG TATAATTATA GATACTATTA TCCAAGGTTT 
327701 TTCTTTTTTT TTTTTTTTGA GATGGAGTCT TGCTCTGTCG CCCAGGCTGG 
327751 AGTGCAGTGG CGCAATCTCG GCTCACTGCA AGCTCCGCTT CCCGGGTTCA 
327801 TGCCATTCTC CTGCCTCAGC CTCCCGAGTA GCTGGGACTA CAGGCGCCCG 
327851 CCACCGCGCC CGGCTAATTT TTTGTATTTT TTTTTTTTTG AGACGGAGTC 
327901 TCGCTCTGTC ACCCAGGCTG GAGTGCAGTG GCGGGATCTC GGCTCACTGC 
327951 AAGCTCCGCC TCCCGGGTTC ACGCCATTCT CCTGCCTCAG CCTCCCAAGT 
328001 AGCTGGGACT ACAGGCGCCC GCCACTACGC CCGGCTAATT TTTTTTTTTG 
328051 TATTTTTAGT AGAGACGGGG TTTCACCGTT TTAGCCGGGA TGGTCTCGAT 
328101 CTCCTGACCT CATGATCCGC CCGCCTCGGC CTCCCAAAGT GCTGGGATTA 
328151 CAGGCGTGAG CCACCGCGCC CGGCCTATCC AAGGTTTTTA TACCCAGTCA 
328201 AGCTGTTCTT CAATGTAAAG GCTATATACT AATAAGCAAT CTAGGCTATA 
328251 ACACCCATGA TCCTTCAAAA AACAAACAAA CAAAAAAACC AGCTGATGAA 
328301 AGAACCTAAC CATAGAAGAT AAATCAAAGA ACTCTGGGTT AGAGAACTCA 
328351 TTGAATTGGT ATTGAATCTA TATATACTTA GAGATAGGAC TAAAAAAACA 
328401 CTCTGGGAAT TATGATAATA GAAATATAAA TATTACAAAT GTATTACTGC 
328451 ATCTTTATAT TTTATTTTTC AATAATTATA TAATCAGCAA CTCGGGGAAG 
328501 CAGTTAAGAC TTTAAGTTTG GGGAGAAATG ATGCCTCTGA TTGGTCTTTT 
328551 TGGGGTGTAT TAATCCATTC TTAAGCTGCT GATAAAGACA TAACAGACTG 
328601 GGCAATTTAC AAAATAAAGA GAGGTTTAAT AGACTTACAG TTCCACGTGG 
328651 CTGGGGAGGC CTCACAATCA CGGCAGAAGG CAAGGAGGAG CAAGTCACGT 
328701 CTTACATGGA TGGCAGCAGG CAAAGAGAGA GAACTTGTGC AGGGAAACTC 
328751 CTCTTTGTAA AACTGTCAGA TCTCCTGAGA CCCATTCACT CTCATGAGAA 
328801 CAGCATGGAA AAAACTTGCC ACCATGATTC AATTACCTCC CGCTGAGTTC 
328851 CTCCTACAAC ACATGGGATT TCAAGATGAG ATTTGAGTAG GCACATGCAG 
328901 CCAAACCATA TTGGGGGTGT CTTAGGTTAG GTTCTCTCGA AACAGATCCT 
328951 GAGACAGGAA TTTGGGTACC ATGGTTATTG AGTGAGTGCT CCTTAGAGGA 
329001 AGGATGTGGA TAAACAGGAT GGGGAGACGT TAAATGAAGA TGTGGTCTCA 
329051 GCTTGAGTCC AGCTTCAGTC TGATCCCATA GAGAGTTCTG GAGCATAAAT 
329101 TGAACCACAG TGCTAGCCCC ATTTTGAGGT GAGGGGCTGG ACTTTTGTTC 
329151 CCTCCTTTCA ATCAGTCATT GACTGCAGGC TGTTCAATAG GGAAGGGATA 
329201 AATAGGGGTG TAACTTTAGA GCTAGATGGC TCGACTTCCA TGCTGGGTAA 
329251 TTCCCTGAAA AAGAGACAGC TGGGGGCTTT AGCACCAGCA CCCACAGCAG 



WO 02/086113^^ tffc PCT/GB02/01887 

109/219 ^ } . 

329301 CAGGGGTCTG GGTGCATCAG TCTGTGAAAG CGAATCTTGG CAGTTTCCCA 
329351 ATGGTGTCTC CTGTAGAGCA TTGATAGCTC TGGTTGATGT TAGCTTTCCC 
329401 TGTCACTGAC TGGTCCTTTA TTATCCTTTA TTATGGCTTA TAAATGTCAG 
329451 TTTTCAGTGT CTGAATGCAT TTCACTGAGA GAAGGAAAGA AAAGAGGCGC 
329501 TGTAGAGGGA AAAGGAAGTG GGGAAAAGCC TAGACTTTTT TTAAGGAGAG 
329551 GAAATGTGCC AGTGAGCTTT GTGGAAGTTG TTTCAGCATG GAGCAGTTAT 
329601 ACATGTTGCC AGATGTGGAG TCTCAACAAT CTCAGAGGGT CAGCCTCTTT 
329651 AGCTATGAAG TGTTGCTTTA TGCCCTCCAG AAATTATCCC TCTCTCCCCT 
3297 01 CCACCTGCTG CTCATCCCAT CCCCCTTTTC TCTCTACTCA CCCTAATGAT 
329751 CCATCTCACA AGAGATGGCA TGTCTCAAAT GGCAAGAACA GGAAGTGACA 
329801 AACATAGACA GTGTTGGTAA TATCGAAATG AAAAGAAATA ATTGTCTCTG 
329851 ATTAAGATGA GACTAGAAAG ACATTGACAT TGCTCTTTGT AGCATGTACT 
329901 AGGGAGCACC CAAATCTCTA AGAGAGGACT AAAGTGTGGC ATTTTGCAAA 
329951 CTTACTTTAC AATTGTACCC ATTGTCTGTT GACTATTCAG TGGGAATAAC 
330001 ATTCTACAAA AACATTCTTT AGCAAATGTG GATGTAGGCA ATGAGATCCA 
330051 GGGGGTTTTC AAGAAATACA GGTGAAGATG ACATTTGTGC TTTAGCAATA 
330101 ACTCTTAAAG AAATGTGGAG GATGGGTTAG AGAGGAAACA GAATGTCAGG 
330151 GAGGCCAGTG AGGGCTATTT CAGTAGTCTA AGTAAAAATA TTAAAGTCCT 
330201 GAACTAATCT TAGTAGTTAT TAAAGCAACC TCCCAGATAA CGCAGGATCC 
330251 CATATCCCTG ACAGAAGAGC CACTGATGCT TCTGTGTCCA TGAAACAGCT 
330301 AATAAAAACT GATGTTTGAG CCTAAGCATT AAGAATGTTT TCCTGATAGG 
330351 GAGCTAAAAT AATCTTTGAA AATCACACCC AAAATATTAG TCCTGCCCTT 
330401 TGAAGCTATG CAGAACAGGT TGCTTGTTCA CTAAGAAAAA GATGTATATA 
330451 TCAAGTGCTG TCTTACAAGT GGGAATACAA AGGTGAGGAA GGCATAGCCC 
330501 TAGCTCTCAA GGAGCTGAAT ATCTAGTGAG AAGAAAAAGA CATACTCAAG 
330551 TTTTGTTGCA ATAAATTTCA GCAGAGGAAA GCTATAGGGA TTTTAGAGGA 
330601 AGAGGTGAAT GACTGTCCAG GCATGCTAAG AGGATTCATG GTGCCTTTAT 
330651 GAAATAATCC TTCTGATATA GGAGGGAAGG AATCAAGTCC ATCCTACCCC 
330701 TACCTCCTGT TTGTTTGTTT CTCAGTTTGC CCAGTTCTTG AACTTGTTCT 
330751 TTTATTACAT GAATCTAAAT TTCATTGAAT CCCATTCCTT CTTCCTCTGG 
330801 AGATTTTGTA CTTTTCCAAT GTTTTTCTTT AAATGTAGGC TTCCAATCAT 
330851 TGGTCATTGT CTCAGCTGTG ATCTGAGGAA CGTAGAATTC AGCATAGTGT 
330901 GCTGGGAAGT GGCAATGTAC CCAAAATGTG TGTGGAAGGC TTGCCAGTAT 
330951 ATTAGCAATA CCTTCCCTCT ACCCTTTGAG TTCTCCTTAG AGGTTTAACT 
331001 CACTTTCTAA GCCTCCAGAA ACTGTATCCT ATGTTCCTTT ATATTGTATT 
331051 AGAAATAAAG AATTGGTTAA ACTCAAAATA AATAGATGTT AGTGCATTAA 
331101 CACAAGCCTG TACCCATTAT AAAAATGTAT TTGGCAAAGA AGGTAATGTT 
331151 TAACCCAAAG AAATATCCTT TAGGGAAGGG ATTTTAGGCA CCTTAATGTA 
331201 AAATACACAT TTGGTAAAGA GGCCCTCTTC AGTACACTTG AGTCACACCA 
331251 TATATTCAAT TTCTGACTTC ATAAGAGGCT AGGTCCTGAT AAAAATATTG 
331301 AAGGAACACA TGTGTTAAAA GTGTAGTATA ACATCTACTA TTATTGGATG 
331351 CTACATAAAA AGCATGGAGC TAAGCAATTC TCCATGTATT ATGATCATGC 
331401 AATTTCAACA ACCACTCTAT AAGATATATG CTATTATCTC CATTTTACAG 
331451 ATGAGGAAAC TGAGGCTTAG AAGTATTGTT CGAGATCGAG TGAAAGAGGC 
331501 CTAGTTTTCA GATCAAGAAA TTTGATTCCA GTTTTGATTT TATCACTTAC 
331551 TGTGATTTTC GCACAAGTCA TGTTACTATG TTTAACCTTT TGTTTCTCAC 
331601 CTGTAAAATG AAGGAAGATG ACTAAACCAT TTCTTGGGGC TGTTCCAGTT 
331651 GTATGATTTT ATGAACATCT CCATGAAAGT ATCTTTATTT AGCAAATAGC 
331701 CAAATCTGAG AAAAAGGCAA ATTTATTAGT CTTTCTAGAA CTCTTTAGCA 
331751 TCCTACCTTT ACCATGAAGC ATTACTCCAC ATTGGCTGGC CTGTATTCTT 
331801 TCCAACTCCT AAACAGTAGC TCTTCTTTCA AATGTCTCCC TCTATTTAGG 
331851 CACAAGCAAG GTGCTGAAAA GCATGAGAAA TATGAGCATA ATACAAGGAA 
331901 ATCGATAAGG TTAGGAATCT TCACACGTTT AACTTAATCA TATCAGACAC 
331951 TGAAAACAAA GTCATATTTT ATTGCCAAAG AGAAATGCTC TTTGGCAAAC 
332001 CATGATAGTA ATCTGATTGT GAACAATGCT TATTCATCAT TTCTGGGGCA 
332051 CTGGCTGCTG TGCCAGCCAA GGGGACAGCT GACCTAATGG GCTTGAAGTT 
332101 GCATTCCTCA ACATTAGTTA TGTTGTTGTG TGCAATCACC AACACAAAGA 
332151 TAGTGTACAT TGGTCAATCA GAAAATATGG GCTCCTTGCT TGTTTCCCGT 
332201 TCTGTTTTTA TTGACGATTG GGGGATAAAT TGCACAGTGA TAGCCAAGAG 
332251 CATAAATTTT CTTTGTACTT TTATATAATC TTGTTTCTCA TTTTACAATA 
332301 TCTGTATTTA TTCCTAAAGT TGAGAAGAGG ATCAGACATT ACAAAACAAT 
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332351 CTGGAAAAGG TATGGTAATA ATCTAAAATA TCCCATAGAA AACACCTTTA 
332401 TGTGTTGCCC TAGGTCACTG AGGCAGTTAT AACAAAAAAT ACTATAGACT 
332451 GGATGGTTTA AAGAAAAAAA CAGTTATTTC TTGCAGTTCT GGAGACTGGG 
332501 AAGTCCAAGC TCAAGATATG GCAGATTTAG TGCCTGGTGA GAACCTGCTT 
332551 CCTGGTTCAT AGAAGTCTGT CTTCTTACTG TGTCCTCGCA TGATGGAAGG 
332601 AATGAGAGAG CCCTCTTGGG TCTCTTTTAT AAGAGCACTA ATCCCATTTA 
332651 TGAAGTCTCT ACCCTCATTA CCTAATCACC TCCCCAAAGC CCCCATCTCC 
332701 TGGTACCATC ACTTTGGTAA GGGTTTCAAC ATATAAATTA GGGTGGGGGA 
332751 TACAAATATT TAGTTAATTA CATGTGTATT TATGTCTCAG AGATGAGGAG 
332801 GTCTTAGTTT CAAATAGTTA CACCTATGCA ATTATCATTC CAAACTTAAC 
332851 TTTTCCTGTA ATCCTCTCCT TTTGATGTCA TACCATTTGT ACACTGTGTA 
332901 AACAGTTACT CTGGGACATA GTCCATAATT TACACTCCCT TTTCCTTAAT 
'332951 TTTACTTTGA GGTTCCTTCT CAGGGCTTCC TTTGGACTCT GTGGTTACCT 
333001 CTATCATAAC TCTTATCACA TAGAATGAGA TATAATTATG GTTGTGAGAT 
333051 CAATCTTTAG ACGAGAGTTT ATTGATTGTT TGTGACTAGC TTTGTGTTAA 
333101 ATGTGGGGTC ACTAAATCCT GCATACACTT CTTTTCAACC AGGTATTGTT 
333151 TCAATAATTC CCTCTATCTC TCTATAAAAA ATTGAATTCC TTAAACCCGG 
333201 GAGTCCTTAT GGATCAGAGC AAGTGAGCAG CACATCGCAT TCTACCAATT 
333251 TAGCTTTTTT ACAGAATCCC AGTCCAAATG GGACTATTCC GTAATGTTCT 
333301 GTGGTCGTAA CTGTTGTCCA GTGTGATTGG CCTGGTTTTC TTAGGCATGA 
333351 AACATGATGT ATTAATTATG ATCTGAACAG TGAGCCAGAA GTTACTGTAT 
333401 CTAGTTTGAC AGTATTAACA TTTATGGCAC ATAGTTAAAC AAGGACTTTT 
333451 GATTCTGAGG TAGGAGGAGT TGGGAAGATA TGAGCAGAAT TTTTATACAC 
333501 GGGGGATTCT GATTAGAGGA TCAACAATTA CAAAGTATAG TTATGCACCG 
333551 TTTAATGACA GAGATATGTT CTGAGAAATG TGTTGTTAGG CGATTTTGTC 
333601 ATTATGCAAA CATCACAGAG TATACATACA CAAACCTAGA TGGTATAGCC 
333651 CACTACACAC CTAGGTTATA TGGTACAGCC TATTGCTCCT AAGATACAAA 
333701 CCTGTACAGC ATGTTACTGT ACTGAATACT GTAGGCAATT GTAGCACAAT 
333751 GATAAGCACG TATATAAACA TATCTGAACA TAGAAAAGAA ATAGTCAAAA 
333801 CATTGTGTTA TAATCTTATG GGACTACCGT CATATATATG GTTTGATTTT 
333851 GTCCAAAACG TCTTTCTGAG ACACATGACT GTTGTCTCAC ACAATAAAGA 
333901 GATAGAACCT TCATAAATGG AGAGTGGAGT GTCAATGCAA AGTGGGATTA 
333951 AATCAGGAAC TTACTTGAAG ACGGAAAGAG GAGGAACACC TTCAGCACCA 
334001 TCTCTCCCAT CCTTATCCCA AAACACTGAT GACTTTACAG CCACCTCCCA 
334051 TACACACTGC AAGCACAGGG CACTGCAAGA AGCTGTGGTT ACAGAGGAGA 
334101 CACTGCCTAA GGGATCAGTC TGAGTGCCAC TGGTATCTTA TGGAAATACA 
334151 AGCCAGGGAC TATCATAGTC AATGGAAGAG GATTCGTGTT AATGTCCCAC 
334201 AGTCAGCCAG AAGTGCTTGT GACTCTATGC ATAGGCACAG TGCCTTTGAA 
334251 AATTGACTTT CAATCACTTT GTGAGAGTGA ACACTGTTAC TGCAGGGTCT 
334301 AGCTGCTCCA CCCATTGTTC TAGATGGGAG ATAAAACAGC CTAACTCTAG 
334351 CTGGATGCTC AGTCCAAACT CTAGGATGAG GGTCAAGGCA GGGACTGCAC 
334401 TATTCATTCA TGTTTGCTTT TCTGGGACTC CTATAACTGA TGCCTTGGAT 
334451 TTGTGGGATG GTTGGTTATG ACCAGTGGCT ACACCTGAGG AACTCACTGT 
334501 AGCTTGATGG TTCATATAGA TTTGAGATTA CCAAAATGTT ATTCAACATT 
334551 CATACTAGAT GGTCTTGAGT ATACTTCAAA TTCAGCATGT TCTCATCTGA 
334601 ACTCACATCT CAACATCACC ACACTGTCCC TTTCCTTTTT CTTATATTTT 
334651 CAGAGTATCA GTATAAATCT CATTGCACCA GGCAGAATCT GAGGAGAAAT 
334701 CCTTGCAGCT CCTTCTTCCT GATTCATCAT CAAAGCCTAC TGATTTTGCC 
334751 TCCAAACCAT TCACTTCTGT TCCTCTTTAC ATATGGACAA TGGGAGGAAC 
334801 TGACTAACCA CTTGGGCTTG TCATTCTGTT TGACCCCTTT CCCCAGATGT 
334851 ATTCTTTGTC CTCTGCCCTA TTCTGCCTTG ACCTTGACTT ATGCCCTCCA 
334901 AGTGGGTCTG GCTGAGAAAT AACTGGCAAG AGATGAGCAT TGGGAAGGGG 
334951 AGATAGGAAA ATATTTATCT CCATCTCTTG CTTTGTCAGC TCTTCTGGTT 
335001 GCTACTGCCC CTACCTGCdA GGCTTTGGTA AACTGCCCCT CTCCTTGTCC 
335051 CTTCAAGCTA TGATGTGGTG CGCCAATGGC TGAGTCTTCC TTTTTGACTC 
335101 CTGAATTTTG CTCATATCTG TGTTAGTGAC TCCTTTGCTA CATTTTCTTT 
335151 AGGCCAACCA CTTTTTTTCT TGAAGAACGC TGGACAACCC ACTATTTTTC 
335201 CCATACTCAT GTTGCCCACT TCAAAACTGC TCTTCATCCT CTAGAGAGTG 
335251 GTCTTTTAAA GGGTAAATTT GACCATCACT TTTCTGATTA AAACTCTCCA 
335301 AGGGGATCTT GTTGATCTTA ATATTGACAG AAATAGCTCT AATGTGGCCC 
335351 ACAGCCTGTA TGAGCTGACT CCTGTCTGCT TCTCCATTGT CATCTCTTGT 
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335401 CACAGTCACT GGGTTCCAGC CATGTTGCTC TTTTATTGTT CCTTGAAATT 
335451 GCCATGGGAC CTCCAGTTTT GAAGGTCTTC TTTAATTCTT CTGTTTTCAG 
335501 AAATTCTCTG CTCCTTTCTT GCCTCCTCCC ATCCCTAGCC TGCAATTTTA 
335551 TAGGAAGTGT TACATTCCTT TTTAAAAATT TTTATTTTAA ATTCAGTGGT 
335601 AGATGTGCAG GATGTGCAGG TTTGTTCCAT AGGTAAATGT GTGTCAGGGG 
335651 GGTTTGTTGT ACAGATTATT TCATCACACA GGTATTAAGC CTAGTGTCCA 
335701 TTAGTTGTTT TTCCTGATCC TCTCCCTCCT CCCTCCCTCT GTCCTCTGAT 
335751 AGGCCCTGTT GTGTGTTGTT CTCCTCTATG TGTCCATGTG TTCTCATCAT 
335801 TTAGCTCCCA CTTACAAGTG AGAACATGTG GTATTTGGTT TTCTGTTCCT 
335851 TTAATTTGCT AAGGATGATG GCCTCCAGCT CCATCTATGT CCCTGCAAAG 
335901 AACATGATCT <?ATTCCTTTT TTTATGGCTG CACAGTAGGA AATG^TACAT 
335951 TCTTAGAGAA AATCTTCCTC TTCATGACAC TTTATGACAT TTCTGCCACA 
336001 GGAGTTTGAT TCTATAGTTA ATATGTTCTC ATAACACCTT ATATCTTTGC 
336051 TCCAAAGCAT TTATGATGGA GTATAATTAC ATAGAATTGT TTCATTGATA 
336101 TAAAATAAGT TCCAGTGTGT GTGGAAGGAT GTTCTGTGCC TTGTTTTTCA 
336151 TTTTATCTGT AACAATTTGC ACAACTTGTG GCAAATGATT ATCATTTTAT 
336201 AATAGTTATT GGGTAAATAA TTAAAAAGTA TCTCTAGCTA AGGCCAACCA 
336251 GGACAGCAAA AATAAAACTA AAATGAAGAG CACTTGAATG AGACAGCAAC 
336301 AGCAGTGAAA GTAGCCTTCC ATGATTATTA TCGAAATATC TGTTGGGAGT 
336351 TAAGCTGAAA ACTGACTTGA GGAGTTTGGG AATAAACATA GCCCACAGTC 
336401 AGAGGAAGGG GCTCATTCAA AAACATTTAG TTGCTCTCAG GTCAACGATA 
336451 GGTCTCAAAT TCCATTTTTC ACGAGCATCC AC AATAAAAC CTCAGAGCAC 
336501 AAGCCAATAA CTCAAGTACT CAAATGCCCT CCAGGGTTCC TTCCAAGTTG 
336551 CTCTGCTJGAC CTGAGCTCAT TACTTTTATA TAATGATTGC AGCAGAAGCA 
336601 TCTATGGAAG TACTGTCTAC AATGAGATAG TCAAAAAAAT ACTAAGGAGA 
336651 TAATTACAGT ATTAATTCTT AAGTTTTGCA GAATTAATTA AACATAAAAA 
336701 ATATTATTTT GGAACATCAA GTTTGGATAT ATGGTGACAA AACAAAACCA 
336751 CTGCTGGCCA GTGAGAACTT CTAGGGAATC TATATTATAC ATTTTTACAT 
336801 TATGTGAATT TGCATAAGTA AATTATCCAG AATAATTTAC TCATGTCAAT 
336851 TAACAGAAAA AAAAACATGA CTTAATTAAA ATCTCATTAC AGGAACTCTC 
336901 TTCAATCAAT AATTTTGGAA AGTGACAGAC AAAGATGGTC CACAGGGGGA 
336951 TACATGAGGA CAAACATAGC TTTAGATAAG ACTGTGTTTG CACCCTAGAC 
337001 TGGAATGTCT CTTCCTCAAA CATGTATTAA GCCATTCCAT TTACTGAGCA 
337051 CCTCTTATAT TCTAAGAACT GTACTTGTCA TCGTGGTGGG GGAAGGGAGG 
337101 CATAAATAAG CTGTGATACT TGGGACAGGA CAACCAAAAC CTAGTTTCCT 
337151 TCAAATAGGA CTTGAGTCAC TCATAGAAGT CTTTCCTAAA TATTCCGGTC 
337201 TTCAAGAATT TCTCTGAAGA CTTCTAAAAA GTACTGCTTA TGTCACGTGT 
337251 GAATCCTTTC TTGATGATGA TGATGATAGA GAAGTATTTA TTATGAAAGA 
337301 AAGAAAGAGA AGGAGAGAGA GAAAGAATTG ACATGAGCCA ATTCCCAAAA 
337351 TAAATCTTTT ATATATATAT GTATATATCT CTCCCTATAT ATATCTATGT 
337401 ATACCATATA TATATATTTA TATGTTACAG ATAAAATGAA TACATATATA 
337451 TGTATTCATA TATATATATT TATATGTTAC AGATAAAATG AATACATATA 
337501 TATATATACC TATACACACA TATCTCCTAN NNWTTCTGTT TCTCTTGAGA 
337551 ACCCTGATTG ATACATAGAT AGAAGAGTGT ATGTAAAACG GAAGCAAATA 
337601 GATCTTCATG GAAGATCAGT TGGGTATGAA AGCAAAGAGC TATGAAAAGT 
337651 ATATGATGCT TATTTGCTTA TTGTTACTTG TACAGATTTT GATATGTATT 
337701 ATTGTAATGA TTACTTTTAT GTCAACATGA CTGAACTAAG GGATGCTCAT 
337751 ATATCTTGTG AAACACTATT TCTGGGTGTC TGTGAGGATG TTTCTGGAAG 
337801 AGACTAGCAT ATGAATGGGT AGACAGCATC AAGAAGATTG CCCTCCCCAT 
337851 TGTGGGTAGG CGTCATCTAA TCCACTGAGG GGCGAAATAG AACAAAGAGG 
337901 CTGAGGAAGG GAGAATTTGC TTTCCGTTTG AAACATCCAT CTTCTCCTGC 
337951 CTGTAGACGT GGGTGCTCCC AGTTCTCAGG TTTTTAGACT CGGATCAAAA 
338001 TATATGCCAT CATTTCTCTC AATTCTCAGG CCTTCAGACT CAGACTGAGT 
338051 TACACCACCA GCTTTCCTAG TTCTCCAGCT TGCAGAGAGC AGACTGTGGG 
338101 ACTCCTTGGC CTCCCATAAC TGCACGAGCC AATTCCCATA ATAAATCTCC 
338151 ATATATATAT ATATATATAT ATATANNNNN NN3SNNNACAC ACACACACAC 
338201 ACACACACAC AGATACACAC CATATATATC TCTCTATATA TATGTATACC 
338251 ATATATATAT GTATATATAC ACACATATTC TCTTGAGAAC CCAGACTGAT 
338301 ATATAGATAG CAGAGTGTAT GAAACGTATG CACAGTCTAT AAATTATATA 
338351 GTAATCACCT ATCTACCCAC TTACCTACCC ACTACCCAGG TAAAGAATTA 
338401 GAATATGGCC AGTGTCTTAG AAACTTTCTG TATGTCTGTC CTCGATTACA 
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338451 TTCTGTTTTA TCCCACCCTC CAGAAGTACT CACCATCCTG TTTTTAGGTT 
338501 AATTACTTCA TTGCTTATAC TTATGATATT ATCAACAGTG TAAATGTTTT 
338551 CAAATAATAT GTGGATTAGT TTTGAAATTT GAATAGCTGT GTAAGAAAAG 
338601 AGCAATATTG CCACTATTAC ACAATGCCTC CCAAGAGAAG AGAAATAGAA 
338651 GTCTGAGAAA TCTCATGGTC CCAACTTCAT GGAAAGTTTA CATGGATGAG 
338701 TCTCCTCTAG GCATGTCAGA GGCCCCACTC AGACC ACGTG GTGGGGTTAA 
338751 GGTGTGTACC ATGTACTGGA TATCCAGCAG GGATATGATC AGGGGTATTC 
338801 TCTGCTTCTG GACTCATCGT TCCCTACCTA CAGCTCTCAG AGTGGGTCAA 
338851 AGCTGAACAT TTCTGTGAAG CAAATTGAGT GTCTTCAGCT GATTCAGGAA 
338901 GCATGGCAGA GTGGACAGGA TATTTATTAG CACATTGACA AGACAAAGTC 
338951 AGACCAGCTA CTATTGAGAA TCATGCTTAC CAAGGAATTT TATTCTCTGG 
339001 TAAAAAGTTG GTATGAGGTG TATGTATCAT GGCATTGGGC ATCTTAGCGT 
339051 GTATAACTTT CAGTGCATCT GACTCTTGGG GTGCATTGGT CCAAACTCTG 
339101 GGTTGCCCTT GAATGGGGGG AACTCTGTCC TTCTGATATA ACATTTTATG 
339151 GTGCTCCATA GATTCTCACA TCCTTTCTCC TATGTTGAAT GACCTGCTTC 
339201 CTATGTCTTG TTCTTTGTTG GCTTATTATT AAGTATTATT ATTATTACTA 
339251 TTATCGTAGA GCTCATATTT GAGTCACTTC TTAAGAAAGG AAAAGGAGTG 
339301 AACAATGAGA ACACATGGAC ACAGGGAGGG GAACAACACA CACTGGGGCC 
339351 TGTTGGGTGG TGGGGTAGGC AGAGAGAGAG CATTAGGAAA AATAGCTAGT 
339401 GCCTGGTGGG CTTAATACCT AGGTGATGGG TTAATAGGTG CAGCAAACCA 
339451 CTATGGCACA CGTTTACCTG TGCAACAAAC CTGCACATCC TGCCCATGTA 
339501 CTCCTGAACT AAAAATAAAA ATTAAAAAAA AAGAAATGAT AAGAGGGAAG 
339551 TGAATTTTTG AATCTTGAAT GTTTGAAAAT ATCTTTATTG AAATATCATA 
339601 TTAATTGCTA GTTTCCCCGG GTTTAGACTT TTAGATCAGA AATAATTTTC 
339651 CTTCAAAATT TCAAAGGCAT TTTTCAATGT TTCCTTTCAG GGTTATTGTT 
339701 GAGAAACCTG ATGCCATCCC AATCCTCAAT TCCTTATCCC CTCATTTGAA 
339751 GCTTATAGAA TTGTGATTTC ACTGTGCTGG AAGCTCACTA GGTCCTTTAC 
339801 ATCTAAAAAC TCTTGTCATT CACTTCTAGG AACTTCTTTT GAAATATTAC 
339851 AGTGGTAATT TTCTCCTTTC TCACTGCTGT GATATCTCTT TCTAGAACAT 
339901 CTGTTATTTG GATATGGGAT GTTCTACATA CTTTGTTCTA CTTTACAGTA 
339951 TATTCTCTTA ACTTTATCTT ACAACTCTTC AATTGATATT TTCATTTTAT 
340001 TTCCTGCTTA TTATTTGAAT TTCTAAGAAC TCATTCAAAA TGATGGGAAT 
340051 ATTTTTCTTA TGTGTTATTC TCTTTTTGTT TCATGTCTGC ATTACCCTAT 
340101 CCCTCTGAGA CTATGAATCA TAGCTTTATG TTCCTTTTTT TGGTCTTCTT 
340151 TTGGCATGGT CTCTATTTTC TCCTAGTTCC TCTTTGTTCT GTTTGTCCTT 
340201 TGGGTCTCTG TATTCTATTT TAGAGTGGCT TCTTTATCTT AGAGGCTTTC 
340251 CTTTAACTCT GATATCCAGT AAACCTAGGT ACCTGTTCAT GATTAAGAAT 
340301 GAACTACTCA GAGGTTTATT TGAAGCTCTC ATTGTCTGGT ATGTTGTCAA 
340351 CTTTCAGATT CGCAAGAGGA GAATGCTCCA GTGTCTTTGG TAATTTTTCC 
340401 TTGTGCTGCT CAGATTTCCC AGAAAACCCT CCCACTCTTC CTCCTGTAAC 
340451 AGCAGAAATC TGACTGCCAG AGCAACTGTC TTGGAGATTA AACAGTCAGC 
340501 ATTATTTTTA CATAGTTACT TATTTCCTTT GTTTTCAGTA TGGTGCCCAC 
340551 ACCCTCAACT GTGCTTGATG TCCCTAGAAA CCTTCTGTTT TGCTTTTTAC 
340601 TTTTTTTTTT TTAAGGGAAT ACGTATCAAC AATTTTATGG TGCAGCAAAC 
340651 ATCTTTTAGA GTCATGGAGT TGAAGAGAAT GGCAACTAGT GGTCTACCTG 
340701 ATTCTTAGAT GGCTTCACCC AAATATCTGT TACTTTGACC TCTTCCTTCC 
340751 CTTTATCTGC CTCTCCTGGT TCCAAACATG ACTGGGATGC CCCCTTCTGG 
340801 GGCCCCTTGA GGATTTGAGG ATTCTGTGGT GTAACTTGTG TTGGTTTTCA 
340851 ATTTTCTTTG ATACTGACTT GTAATTTAAC TTTCTCGGCT CTATCAAGTT 
340901 ACTTCTTATC CACTTGCTTT TGGCTTCTAA AATTTGGCTG CTGTTGTCTT 
340951 CTTTTCTTTC TTCCCATTTG CAAGAGTTTG TTTCTACAAA AAAAAAACAT 
341001 TGTTTATAAT GAATTTAGTG GGTTTCATGG GGAAACAAAA TTAATGTATG 
341051 TGTTTAACCT GTCATCTTAA CCAGTAACTC GGCTGGTGTA GTTTTAATGT 
341101 TAAGCTTTCA CCTATTTAAC ACATATTTCT' TGATCTCTGT GGAGTGAAGA 
341151 AGAACATCAT CTCGCCTCCA TAAATTCTAA GACAATTACT TTTGGCATCA 
341201 TTTAGATACC CTTTCCTTAT TAAGTTTAGG ATAGTCAAGC TTTCAAGCAC 
341251 TAAAAGTGCT GGGTAATAAT GGTAATTAGC CAAATTGGGA AAATAAGGCC 
341301 ATCTCTCTAC CTCGTAATTT CCTTTTTTCC CTCTAGGTTT AGCAGTACAA 
341351 TTTGAATTAA GACAAATTCA GATGTTCTCT AAACATATTA ATCCCATTGC 
341401 ACCCCCAAAT TATACACTTT TTCTTCTTCT AATGTTAAAA TAGTTACTAA 
341451 ACCAGCGAAT GTTCCTTAAC CACACTTATG CTACGAGCCC CTTATCCAAA 
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341501 CATTCTCTCA GTTTGGATGA TTTCACCTCC TGAATATCTC CTGCATTTTT 
-341551 TCCTTTTGTA TTTTCCCTGC TGCAAATCTC TCAAACCTAC CTCTTTGATT 
341601 GCTGTCAGAC AGTTTTTTTT TCTGAATTAC AAATATGACC ATGTCACTTT 
341651 CTTTTTTAAC TCCCAAATGA ATCTGTATTG CTTTAATAAA GTCTAAACTT 
341701 AGAGCAGCAT CAAAAGCTGT CAAAGATGCC CTAACTGCCG TCGTATTTTC 
341751 AACCTCATTT TTCACCATGG CTTTCCTGGG TGCAGCACTT CAGTAAGAAC 
341801 ACATGCAGGT GTGTGTATCC ATGGCAGACT CCAGGCAATT TCATACTGCT 
341851 GTGCCCTTAC ACATGCTGCT CTCTCTGCTT GGAGCATCCT CCTTGTGCTG 
341901 TCTCCTCTGG TGACATCCTT CAAAACCAGG TTGGATGTCT CTAGGAATCT 
341951 CTCACCAGCC CTCTTGAGGT TGGGCTCAGA GCCCCTTCTT TACCCATGGA 
342001 GAGTCCTCTG CATTGGTGCA TCCTGCATCG ACATGGTTTG CCTGAGCTTC 
342051 CATGCCCTCT TGGAGCCCCA TGATTTTATT CCTCATGTCT TGCATGCTGC 
342101 CTGACAGTGT AGAATGTTTC CTACTGTGAG AATATTGAAT CTGAGGGCTG 
342151 CCAACAGATG AATCTTTTCC CTTAGAGGAA TGTTTTACCC ATTTCCATAA 
342201 TTATTTCAGC CATTTAGATT TAAAATTCAC TCACTGGCCT AGGATTAAGG 
342251 TAACTTTAGT GTATTCCTTT TTTTTCTTTC CTTCTTTCTT TCTTTTTCTT 
342301 TTTCTTTTTT TTTTTTTTTT GAGACGTAGT CTCGCTCTGT CACCCCGGCT 
342351 GGAGTGCAGT GGTGCCATCT CAGCTCACTG CAACCTCTGT CTCCCGGGTT 
342401 CACACCATTC TCCTGCCTCA GCCTCCTGAG TAGCTGGGAC TACAGGCGCC 
342451 CGCTACCATG CCCAGCTAAT TTTTTGTATT TTTAGTAGAG ACAGGTTTTC 
342501 ACCGTGTTAG CCAGGATGGT CTCGATCTCC TGACCTTGTG ATCCACCTGT 
342551 CTCGGCCTCT TAAAGGGCTG GGATTACAGG TGTGAGCCAC CATGCTTGGC 
342601 CAACTTTGGT CTGTTCTTCA TGAGCTGTTT AATTTCTCTG GGCCTCAGCT 
342651 ATGTCATTTT AATCATTATT TGTTTAGATT CATATAATTT AACTCTAGCA 
342701 TTGTCATCTT TCTCAAAATC CTTCTGTCTT TTCCTGATAC TTGTCCCTCC 
342751 AGCCCCATAC ATCCTTTTCT GGTGTTTCCT AGATAAAGTT TGCTGCTAGC 
342801 AAGCTCTCAA TAAGTTCACC CAAGTTTTAG CTAATAATTT CAAATACACT 
342851 CCCAAATGGA TTTGCATTAC TAGAAAGAGT TCTGGATGCT TGAAAAGTCT 
342901 TATAACTCCA AAGATTAAAG CTTCCTATGG ATGGGAGATT TCATAGTGCT 
342951 CATTAAGGAT ATTGGGTGAA GGGGCTCTAC TCTGGATCTT CTCCTCCCAC 
343001 CACCTAAATA ACTGTACCAC ATGCCTTTAG GAGGGTGTGT GTGTGTGTTA 
343051 CAGAACATGG TCTCCAATTC TTTTGCTTTT ATTTCCAGCC AATGAGCTCA 
343101 GGGATGTGAG TTGGATAGAA ACAGTCGGCT AGACTACCGT GAGATCTGAA 
343151 GGATAGGTGT TTATTTTAGT ACGTTTATTC AGAGAATAGC AGGCACCTAG 
343201 GAAGGAAAAC CGATTACAGA GATAAAAAGC AGTTAGGATC TCTCGGAAGG 
343251 ATAGGAATTC ATGTGCATTC AGAAAGAAAA AAAAACAGTT GTTCTTGTGT 
343301 GTGTAAGGAA AATCATCTCT GTGGAAATAG GTTGTTTACG TTTTCAAGTC 
343351 GGCAAATTCC CTTCACAGCC ACACCACGTA TCAAATAGAG AAGGGAGACT 
343401 TTTTTTTTTG GTCCCTGGTG TGCACAAGGA GGATCCCTTG AGATGCATCC 
343451 TCGTGGAGCA GGAAAGATCA TAATAGCAAT AATGACAACA ATGCATCAAT 
343501 TGTTCTCAAA GGCATCTGTG GGGCAGAGTA TGCTTTTCAA AGGACATTCA 
343551 CTTCTATTCC ATCCCATTTG ATCATTATGA CAACCGCAGG AGGGAGATAA 
343601 GATGGGTTTC ATTATCACCA TTTCACAAGT AAGCATCCTG AGATGGGGAC 
343651 AATGTAGAGT TATTTTTCTG GAATCACAGA GCTAGCTAGT GGCAGCTTTG 
343701 GAGCCAGAGC CCAGTTTTTT CTAAGAATTT CTGTCACATA AATGGATTAT 
343751 GTGTACTGCA TGAACACAGA AATGCCCAGG ATGTGGAGAA AGCCCAAACC 
343801 TCTCTCTTAC TTGGAAGTCT GGATCAGTGG TTCTGGGAAT TCGCAGATTT 
343851 GCCTGGCAGT TCTCTTAAGA GGACACAGAA AGGCCAACTG GGAAATCCCT 
343901 TAACACTAGT TCATTGATAC CTTGACTATC TTAGCCAGGA GAATTGGTGT 
343951 TTTTATTTTC CCTGGCTAGC TGTTGGGTTT GTAAATCTCC AGAGATAAAT 
344001 TAAGGATTTG TTCAGTGGCC TTGAATGAAT CATGAATTGT TTCATAATTG 
344051 TTTCTTTATG CAAAGAAGAG CAGAAGGGAG TATTAGTGGC CTGGTTCTTT 
344101 GGACGGTGTG TCTACTGACC TGGTATCCAA GGGATGTGTT TGCTCTGCTT 
344151 GATGAACAGA GAAGAAAGAA GAGGAAAGGA AAGCAATGCT CAGCTCTTCC 
344201 ATGTAACACA GACTACTCTG CATTTGTTAT CCCATCAGCT CAGAATGTTA 
344251 GGTGGCAAGG GTTTTGGAGA TCTAGTGCAA ACCCCTCACT CAGAGAAGGG 
344301 AATTGAGACT CACAGAGCTG AAGTGTCCAC CTAAAGTCCA GGGAACCAGT 
344351 TAACAACAAA GCTGAGACTG GAATAAAAAA GCCAATAGGA CTAGTTTCTA 
344401 GCTACCATTT TCTTAGAATC TCCTTTGTGC TTGCTACAGT GATGCACACT 
344451 TGTGTATCTG TTGCTCTATT GTTTCTTACA TTCCCCAAAA TTGAGTCCTA 
344501 TTATTCCAGT GTTTTAGTGA GGGTAAGTTA GGCTAGTTTG AGGTAACAAA 
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344551 GTAGCACTGC AATCTCAGAA TCATTAATAT AACATAGATT TATTTATTGT 
344601 TCTTGCTACT AATTTAGCGT AGATTAGTGG GTGACAGATG CAGCATGTAT 
344651 GGAAGTGCTT CTTCTTGTCA CTGCCACCAA AGGACCCAGA CTGAGAGAAT 
344701 GTCCAACTTG CAGTAGGTGC ACCCGAGAGC ACACATCTCT GGGTTTGCTG 
344751 GGGCAAAAGA AGAGAATTCC CTTCTCTGTC GTTAGGCCCA TAAGAGCTTC 
344801 ACATCGCTTT TTCTCATAGC CCCTTGATCA GATTAGTCAC ACATACCCAT 
344851 CCAACTGCCA AAGAGCTGAG AAGTCACTGT CTTCCTGTCC TTCAAAGGAG 
344901 ATGAGGACAG ACACTGAGGA ACCTGAGAGA TGACAGCCAC. TTTGATCTTC 
344951 TTGCTTCTAC TCATAACCTT TAATTGCCAC ATAGCAGACA AAGTGATGTT 
345001 TTTAAAATAA AAATTGTATT TTATATATAT AAATATATTT ATAATTATGT 
345051 TATATTACAT GTTTTATTTA TAACTGTATA TTACATATAA ATTATATTTA 
345101 TAATTATACA CATAACTATT GATACAATAT ATAAAAAATA TATTATAAAC 
345151 ACATAATTAC ATATATACAA TTATATATAA CTTCTAGGAA AAGTAAATTC 
345201 ATATATATAC ATACACACAC ATATATATAT ACATAAATTC ATTATTATTT 
345251 CACAGCTCTG ATGGAAGTTC CTTCCGACTG TATTTCCTTC CTATCATCTT 
345301 AAGATAAAAG ACAAGCCACT GTGCCATAAG GCCTCTAGGC CTGTGTGGCC 
345351 TGGCCTTTTC TATTCTCATG CTTCAGTCCC CTAAACGTGT GCTGTAGCAG 
345401 GCATGCTTTC TTCCAGTTCT TGCATATTCC AAGTGCTTTC TTTGCTCAGG 
345451 TTCTTGACCC CTGCTTTCTT TTTGCTTGGC / TAATTCTTAT GCAGCTTTCA 
345501 CGTTTCAGCT TCAGTGTCAT CTCTGACCTG CTCTACGTTC CATGCACGTA 
345551 GAGAGCATGG GTATTTTGTT CTCCATTATA GATTCAGCAC ATCATACTGC 
345601 ACCTGGCACA AAGAATATAC TCAATAAATA TTTGTTTATA TTAATTAACA 
345651 AATTAATAAA TATTTGTTAA ATGATTACAC GCAAGAATGA ATGATCAGAG 
345701 GCCTCCCATA TACAGCACTA TATGTAGTCT TCTAAAGCAC ATTGACATAG 
345751 CCATAACAGA TAAATCAGTC ACTTAGAACA GCAACCATTT ATATTCTTTT 
345801 ATATTCTTAG CAATATTATT ATGATAAAGA TGCCCACATT GCAGGTAATG 
345851 ATAATTCCAA GTGTTCGCAA TTCATCACCT GCCTACTGGA CATTTTCAGC 
345901 ATACAACATA TTATTATTTT TAGTATTAAA GTACATAATA GGAATGGAAA 
345951 TCTAAGGTTT GAGTAACAGG AAATGTGCAA CATATTTTCA AAGACAGATA 
346001 CCACCAATTC AGCCTCTCTC ATGAGACCAA TGGAGTGATA TTTTCCAAAT 
346051 AAGGTATCAA ATGCAAAGTG ACAAAGGATT TTTGCATCAT TTCCAAATGT 
346101 GGACCAGTAC TTAAGAAGAG TAGTTTTAAC TCCCTACATT GACATAGCTA 
346151 AGGCAATTTG ATGGTTCGGA AGGACAATGG CATGGAATAT GTATAATTAG 
346201 AACTGTCATG GAGAATCTGA GACAGATTGT TGTCATCTAG TCAGATCTCA 
346251 ACAGAAGAAA AGTGATTGGA AGCCTGATAT ACCTATTTTA AAAGGTACAA 
346301 TGAGTGGTAT AATTAAAAAG GTTGGGTATA AGCTTGACGA AGTTAGCAGA 
346351 TACTCATTAA AACCATTTCA GTAATTCCTA TAATAATACT GGCCATGGTT 
346401 TTAAAGTCAC CTAGTGAAAG TCATTATATT GAAATAATTA AAGAAAAACT 
346451 TCATATTGTG TTTCCTTGGC AATAGAACAT TCAGATGGAG CCACACTAAA 
346501 TATTTTTCTA TTTATTGCAG CCATCTCCTT AATGACACTT TTCAGCACAT 
346551 ATTATACATT AAAATTGTTT ATGATATTTT GTTATTGCTG AATTTGATTT 
346601 AATTACATCA GGGCTTAAGG AAAAGGAGTC TAAACTGAGA GAGTTCATGA 
346651 AGACTTCGTT TTCTAACAGT TTAGCGAAAG GTCTTTCAAT TCCCTTGCCT 
346701 CTGCTTATAC GCCTCCTGCT CCCCACTCCT AAAACCATAC ATAAAATGGT 
346751 ATTGTCCAAT CTCTTCTTAG TCTTACCTTC ACTAAAGGCT GTGAACTGCC 
346801 TTCAGAAATA TACTCGTAAA ACTTACACAG ACTCTCATTA GGAAGCTGCT 
346851 ACCCTCCACT CCCGTCACAG CTTAACTCCA ATCTCTTTGG TCTTTTAAAT 
346901 ACCTTGTTTC CTCTCCATCT TTAAGAGACA AATAGATATA TCTTTCAAGA 
346951 ATGTTTCAAA TGAGAACTCA TAACTTTCTT GAGGGAATTC TCTTGCCCAC 
347001 TTGGAAAATA GCTTCTCTGG CAGGGGTCGT GATTTCCGAT GGGGGCGAAG 
347051 TCACGGCTTT TCGCAACTAC TCTGTATTTT TGTTTTAAAT GTACAGGAAT 
347101 CTCTTCTAAA GTTCAGATAT AGACTTATAC TCCCCTGACA TACTCGGGGG 
347151 GTCTGGATAT TTTCGGAGGT TGAAGCATTC ATATTCCATC CAGTGGTTTG 
347201 TATGAAAGTA GAGTCAGCGT GATCAACACT GGACACTGGG AGGTGACTCT 
347251 CGGACAGCCA AACCTCTCCT TTCTGTGCCA GCTTCCCCCT TAGGAAGACC 
347301 CTACTATACC ATTGTCAGGA ATTCCGGAAT TCCATACTCC CTTCCTCAAA 
347351 ACCCCCCTTT TTCCTGAAAT CCACTCTCAG GAAAACAGGT AAGCTCCTTT 
347401 CAATATACAA ATGAATCATC AGATCTTATA CTCTAGCTTT CCTGCAGATA 
347451 TTTGAGTAAT TGCAGAAGGC TAAGCTTTAA GTTAAAGAAG TTAATCTTTG 
347501 TCATTTATAA TAGAAAAAAA ATAAAAGCAA ATGACTAACA GGGAAGAAAT 
347551 TAAGTAAGTT ATTACTTATA CCTGATGGCA TGTTACCAAA GCACATTGAC 
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347 601 AGAAAGTTTA AATTTTTTTC CTATAAAATT ATTAAACATT TAAAACTTGT 
347 651 AGAATATTAA TGGCAAAACA TGCTTATGCC ATATGTTGAG CAAACACAGG 
347701 AGGAAAGAGT ATAATACGAT AAAAACTATT TAAAATGTCA CAGAAAAAGA 
347751 CTAGAAGTGA ATAGTTGCTT TTGAGTAATG GGATTATGGT TTTTATTTTA 
347801 ATTTAAAACA TCCCTTTATT ATTTTTTGTA ACAGGGAAAA ATGTAAGTGT 
347851 AAAATAAAGA ATGGATTCTC AATGGGAGAT TTTTTTTAAA AAACCAGTTT 
347901 CTGCTTGAAT ATATATACAT ACAGTTTTTT TTCTTTCTTT CTTTCTTTTT 
347951 TTTTTTTTTT TTTGAGATGG AGTCTCACTC TGTCTCCAAG GCTGGAGTGC 
348001 AGTGGCGCGA TCTTGGCTCA CTGCAACCTC CACTTCCCGG GTTCCAGTGA 
348051 TTCTTCTACC TTAACCTCCT GAGTA(?CTGG TATGACAGGC ATGTGTCACC 
348101 ATGCCTGGCT AATTTTTGTA TTTACAGTAG AGATGGGGTT TCACCATGTT 
348151 GGCCAGGCTG GTCTCGAACT CCTGACCTCA GTTGATCCAC AAGTCCCAGT 
348201 CTCCCAAAGT GCTGGGATTA CAGGCATGAG CCACCGTGCC TGGCCGAATA 
348251 TCTTTATCTA CTAACACCTA CACTCCTTAT TTGGTGGGGA ATGGCATACC 
348301 AGTTCTCCCC ACATGCTTTC CAGAAGGCTG ATTGAGTGGA ACACTTGGAG 
348351 CTGATGACTT GGATTTACAT AAGTCTCATC TTACTAACAC ATCAAGGCTT 
348401 TTGGATTTAA ACCTCTTCAG CATCCTCTGT ATCCCTAAGG AATGCAGTTT 
348451 CTGCCCCTCT TATCCCTGAG TATAGAGGAG AATAAAGCTT TGGCATCAAA 
348501 TCGCCTTCAG TTGTGGTTGA ATCCACCATT TGCTAGCCGT GTGACCTGGG 
348551 TGTCTACTTA GGCAAGGAAG ACAAGCTTTC TTATCTGTAA GGTGGTAATC 
348601 TGGTGAAATA AAAATGAGAA AATCTATGTT AAAAAATTCT TAGCACAGAT 
348651 TCTCAAACAC GAATAGTTAC TTAATATCTC ATGACTCATT ATTGCACTAT 
348701 GCCTCCAAAA CTGAATGTGC CTCATAGTGT GGGTTTTATC AAGATTGGGA 
348751 TCTGCCTCTC ATTTCTTAGG ATGAGATCTT TCTTAGGAAT ACCTGTGGCT 
348801 CTATTTCCTC CTTCCCCAAT CATGCCTGAA ACCTTGGATT GGCAAAGTCC 
348851 TGCTTCATTC AAATAAGCCC TGGGTTTGCT CAAGAGAACA GTGGCTTATC 
348901 TTTTCCAAGF TAATAGAATA CAATTTAAAT TACGTTTTCA CCACAAAGAC 
348951 TGAGTTAATC CTGCTAGAGG CTTTTTCCAT TTCCTGACCA ATACGATTTG 
349001 GCACTGGAGG AGACTATGGT ATGTTAATAA CAGTATTAAT TGTTAAAATT 
349051 CACTTGTTAC AAAGTTTTCA TTGTCTTGGT ATCATTATAA TTTTCCTTTT 
349101 ATAAGGAGAA TCGACTTAAA TCACTTGGCT GCTCTTTGTT CTTCCTGTTT 
349151 GCTAAGCACT GGCGACGTCT TGTTTTCAGC ACTTAACTGA* GTGCACATCT 
349201 CAGCTCTCAC CTAAAGCCTT TGCCAAGGGG CTAATTTCCA ACATTGACAG 
349251 CATCCTCCAC TCCTGCGTCC CTGAAGCAAA GCTAAAGAAA AAGTAGGAAA 
349301 TAGCAAGAGG GAAGAAGTAA TTTCCTTCAC TTCTTTCTGA GTCTGTTCAT 
349351 TCCTTCTTGG AATTCAGTTG AGGGTTAACT GTCTTCTCCT TTCCTGCGAA 
349401 TGCTTCACGC CGCCTCCTGG TGACCAAATA AAGCCATACT TTTTTTGCTG 
349451 TCTTTGGAAA TATTCTTCAT GGCTCCTGTG GGCTGTGGGA GCAGGAAGCC 
349501 AACAAAAGAT AAGGTCTCAA ATCTCTTCAA ACAGTCCTCA CACTTGCTCT 
349551 GCTGCCAAGT CATTAAGCAA GCAAACTCCT CTCCCATACG GGCTGTTAAC 
349601 TTTTATTTGT GTAGTGCTGC TTTTAAACAT GGCTGGCCTG AACTCTTTTT 
349651 TTTTTTTCTC CTTGTATTGA GAAGAAACTG CCCAGCATCT GCTTTGATCA 
349701 CTGCAATGTC TGCTAGTCTA CTAGACTTTT GGCCTATCTC AGATGACTCT 
349751 TTGAATGTAG CTTGTTTTTT TTGTTGTTGT TGGTGGTGGT GGTGGTTGTT 
349801 TTTGTTTGTT TGTTTAGCCT CATTCATGTA TTCATTCATT CAGGATTCAT 
349851 TTTTCTAATT CAACAAATTT ATTTTAATCA GCTAGTATTT GACCATTCTA 
349901 GATGTTAGGT ATATATCAGG GAATATAGCA GTTAAAAAAA AAAAAGAAAA 
349951 AATCCTTGCC CTTCTAAAGC TCACTGTCCA GTAGGGGAAG GGAGCTATGC 
350001 AAGTTACTGC ATATAGTGTG CCATAATGTG GTTAAGTGCT TTGGGGAAAA 
350051 AGTAGAACAG GGTAAAGAAG ATCAGAAGAA TCAATGGGAG AGGTCTATCA 
350101 TTTTAAATGA AGTGGTCAGG ATGGGTTCCA TTGAGAAGTG AATGTGAGTC 
350151 AAGACTCAAG GAACTGCAGG CATGAGCCAA AAAGCCCCAC GAGGCAAGAA 
350201 CGCTTCAGAT CCTGTGGGAG GAAACCAAGA CATAGAAGCT TTTGTTCATA 
350251 TTCTTGGCAG TATGAGTCAT TGGGGGAACA CAGACTAATA AAGAATGTTT 
350301 CTTGCCTTCA AAGAGTTTAT AGTCTAATAT AAGAAACAGG TCCAAAATAG 
350351 GCAATTTCAG TAGACTGTGA TTGATAAAAC ACTTGGCTAA GGATGCTTTA 
350401 GGGGCATAAA ATGGTCTGAA GAAGGTGATA TTGTTAATGG CTTCTCTTTG 
350451 TCAGCATTTG TTAAATGGCC TTTTGTAGAG TATAGACCAC TAATTAC AGA - 
350501 AAATAAGAGT CATTGGGGGG ACTCAAATTT GTTTAGCAAG GTCTATATAA 
350551 GTTAAATGAT ATAATAATAT ATTAAAATTT ATACATTAAG CGTAATAATT 
350601 AAGTGCTCAA TTTATTGATT ATATTTATAT AAGCTACTAC ATAGCAATAT 
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350651 ATTATTAATT 
350701 TTATTTTAAT 
350751 TATATTAAAT 
350801 CAGTGCAGTG 
350851 AAGAGATTCT 
350901 GCCACCATGC 
350951 CATGTTGGCC 
351001 CAGCCTTCCA 
351051 TCAATATATT 
351101 AATTAAATCA 
351151 TTTATATAAG 
351201 TTAACTACTT 
351251 TGACACTTTA 
3513 01 GGCTGGCTTC 
351351 ACCCCAGGT.C 
351401 AGATCAACAT 
351451 TGATGAGTAC 
351501 CTGTGTGAAC 
351551 TTGCAAAAAA 
351601 TTTTTCAGCC 
351651 TGCTACTGCT 
351701 TACCTGCAAG 
351751 ATGGAGCAAA 
351801 TGCTGCTGCC 
351851 AAATTCTACT 
351901 TCTATTTTAT 
351951 CCCACTCTAT 
352001 TTCCCCCCTG 
352051 TGAGTAACTT 
352101 ACCGTGCAGG 
352151 AGCTTCCAAT 
352201 GTTGAAAGCA 
352251 ATCTCATGAG 
352301 TGCCAAACCA 
352351 CCCACCTCCA 
352401 TAGATAGCCA 
352451 TCTCCTTACA 
352501 TTTGAATCTT 
352551 GTGGCTGGTG 
352601 TTGCCACTTC 
352651 GATGCTGAGA 
352701 ACCTCGTTTT 
352751 TAACAAAATA 
352801 TGATCTGTCT 
352851 AAGGAGAGAG 
352901 GTAAAAGCAT 
352951 GCACACAGCT 
353001 CCCCATCAAA 
353051 CTTTGAGATT 
353101 GCCCAATGGG 
353151 TGCTACTTCC 
353201 CCAGTCACGG 
353251 GAGGCATCGG 
353301 AACAAAAGCA 
353351 TAGCTTTTTT 
353401 CCTGGGCATC 
353451 TGTTCCTTAG 
353501 GCAGAAAAGC 
353551 AAATATTTGT 
353601 CATAGGTGAC 
353651 AGTATACCTT 
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TATATTTAGA TTGTCTCAAT 
GTATGGATTA TGTAAATGTA 
TTTTTTTTTT GAGGCAGAGT 
GCGAGATCTT GGCTCACTGC 
CCTGCCTCAG CCTCCCAAGT 
CCAGCTAATT TTTGTATATT 
AGGATGGTCT GGATCTCTTG 
AATTGCTGAG ATTACAGGTG 
AAAATATTTG TTAAGTAATA 
TTAAATAATT GATAATGAGA 
TTAATAAATT TTAAATATTA 
TTACAGTGGC CACTAGAAAA 
TTTCTATTTG TCAGCACTGG 
CTAGTTCCTT ACCTCTTAGC 
ACTGAGGGAC ACTGTTGAGG 
ACAATGTCTT TCCAACTGCC 
TTCTCATGCA GGGGATTTAG 
TGTTAGCAGG TACATGAATT 
ACAGTCTAAG TTTTACCCAC 
GCTTGTGGTG TTCATTTACC 
GATTATTATA ATAATTATTA 
AAAACTCAGT CTAGTTTTGG 
TGTTACTCAT GTAAAACTCT 
ATTTCATTAC CTGGGACATT 
TTAAAACAGA TTCCTGGTTT 
TCCTTGGAAG CAGTTCTGTG 
ACAGTTCTAT CATTCCATTC 
TATCAGTCCA TTTTTTCATT 
ATTAAAAAAA AAGAGTTGTA 
AAGCATGATG CTGGCATCTG 
CATGGCAGAA GGTGAAGGGG 
AGAGCAAGTG GGGAGGTACC 
AACTCACTCA CTATTGCTAG 
TTCACGAAAA AATCATGATC 
ACACTGGGGA TAACATTTCA 
AACTATATTA CTCATGCAGC 
GTGGGCCTGA AGCTCTGAAA 
CCCTTGACAG AACAGTCAAG 
ACATCCGTAT CTCCTAAGGC 
TGCTAGGTGT GCACACTTAA 
TTAGAAGTCC AAAAGTCACC 
CTTAATTACA AAGAAAAATT 
GAGAATGTAT GTGGGCCATA 
GTTGGATTTT ATTCTTATTC 
AAGTCCATTT ATTAGCTCTG 
GGATTTTAGA GTCAGATACA 
GTGTATCAGC CCCGCATCAT 
ACAGGCAGTG CCCCTGTGGT 
CTTCATAATT TTACTTTCGA 
ACGCTGGAGC AAGCCCTGGG 
ACCCCTCCAG GGAACCACTT 
CTGCTGCTGC TGCCCCTGCA 
GGTTGGCACC TGCTCCCAGG 
CACCATGCAG GTCAAGAGGG 
TTTTTCCTTT TTGCATAAAG 
ACAAATTTTG TAGCTGACCC 
CCATTTGATT TTCTGTGTAT 
ACCCTAAGAT ATGCCTGTTT 
GTGGATGTGT ACTTATGCAT 
ATAAGACTTC AGAGATTGTT 
TACCAGCACA GGGCAGAATG 
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AGGATTAATT CATAGATTGA 
GTTCATTATG TTTATGTCAA 
CTTGCTCTGT CACCACACTG 
AACCTCCGCC TTCTGGGTTC 
AGCTGGGATT ACAGGCGCCC 
TAGTAGAGAC GGGGTTTCAT 
ACCTCGTGAT CCGCCCACTT 
TGAGCCACTA TGCCAGGCTC 
TGAAATTAAT GTATTTTGTA 
TTTTATGTAT TTAATTACAT 
TAATTAATTT CACTTTCATT 
TTTGAAATTA CACATGTGGC 
TCTTGATTCA CTGAACAGAT 
ATTCCAGAAT GAACAAGGTT 
AGAAAAAGTT GGATTCATTG 
TTACCTTCTT CTTAGTAACC 
ATAAAACTTT CTTAGACTTT 
TATGTGATTC TTCATTCTTT 
TCTATAAAGT CTGTTCTCTA 
ATTCAACACA GGTTAAGCAC 
TTTTGCGATA GTTTTGGTTT 
ATTTCTTTAT ACCGCAACCC 
CAATGAGTGT CTTTTATCTT 
TGTGCCTTGA ATTGTGTATG 
ATATACTCAT ATTGATGGTC 
TTCTGTTTCC TCTGTCCCCA 
TTGAGAGTCA TAATTCAGCA 
GATATAAAAA TACCTGAGAC 
ATTGGCTCAC AGTTCTGCAG 
CTTCTGGGGA GGTCTCAGAA 
GAGCAGGTAC ATGACTTGGT 
ACACAGTTTT AAATGACCAG 
GACAGTACCA AGAGGGATGG 
CAATCAGCCT CTGACCAGGC 
ATGTGAGATT TGGGCAGGGA 
CAAACACCCT CGCTGGTCAA 
TGTTTTTCTG GTCTGAAGAT 
CCATACTACT CAATTTTAAA 
AAAGTGTTCT AGTTAATGAT 
TTGAGGCTCT GTATGAGTCT 
AGAAATTATT ACCTGGCATA 
AAATATCTTC TTTCCTTGGC 
GGCCTGGGAA ATTTAAGATG 
TGATCCCCAG ACTTCTCACT 
GAAGGCAACA TGATTCTGTG 
TTTTAAATCC CAGCTTCATT 
GTGTGTGCGG TAGCACAAGG 
TTAATGCCCT GCAGTTGCTG 
ATTTGTGGTT CATAAGTGAA 
TCGCTGGCCT TCTCACCTCC 
TGCTGCTCCT CCACTCCTGC 
GGGTCTGGAT GCAGGCATGG 
GAGTGTGACA TTAAATAGCA 
AGACCACAGA AGAAAAGAAA 
CAGCTGCATT TTGATTTTGT 
TAGCTGTACA ACTCTAGGAA 
CCATCAGAGT AATGGTTCAA 
GTATGTGTTT ATAAATATAT 
ATATGTGAGT CCAGTGCATA 
TAACTGCTTA CAACAATAGT 
GCTGTAAGAA CTTCAAGGTT 
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353701 GCTGAACCTG TCACTGTGGA AGAGTGTAGA AATGGACCAC TGGGCAGATA 
353751 TTCCCTCTTT AGGGGAGCAC AGCTACTGCC ACACTTTGAC CCAGCAGGGG 
353801 TGGGCTGGTG ATTGCATTCA CCGACTTTTC TCCTCTGATT CTCTCACCTC 
353851 ATGCTGGTGC CTTCTTCTGA TAGAAAAAGG ACCAGCAGCC CCACTGCAGG 
353901 GACCACAGTT GACGTGGTCC CTGGAGGCCA GCCTTCCAGG GATGGAGTGG 
353951 GGTGAGGAGG TTGGCTAGTA GATCTGTAGG GCAAATAAAG AATGCCCAGC 
354001 TCACCTTTAG CAGGGTTGCT TTGAAGATTA CATGAGATGA AATAAGTAAA 
354051 ACCCTTTATA ACTGGAAAGA CAGAGGCATC TATTATTACC GTTAAGATGA 
354101 TTCATCTCTA TGGAGAAACT GGAGCAAGTT TGAGTTCAGG TAGTGGGATT 
354151 GGAAGAGTAA AGAAAGTTCT GCGGCATTCC TGATTCCGTC ATAGCACCAT 
354201 GCTCAGAAAT GGTGATGTCA TGTTAAGGTA CAAAATAAAA TTCCAGTGAC 
354251 ACAGCCTCAA AATTAGTAAC ACAGAGATGG GGGTGTTGCG ACATCTGGGT 
354301 TCTCATTCTG GCTCTGACAC CTTTCTGCAG AACAATCACA CAAGTCACTT 
354351 GCCTCCTCTC CTCCCCTCCT TACTCGTCTT TGAGAAGGGC GGGCTAGTTA 
354401 CAGGCTGCTT GTGAGGATTG GAGAGATATC TAGAAACTGC TCTGATACAA 
354451 GCCACAGCTT TTATGAGCGA GTCTTTTCCT GGGCCAAGCA. TGGAAATCTA 
354501 GGTTGGTGGG TCTCTGAAGC AGTTGGTTGT CCCTAAACAG CGACTTCATT 
354551 TCCGGAGCTA CTTTTTAAGT GTTGGTGAAA ACACAAATAA CTGGAGACTG 
354601 TGAAGTCTTT TAGCGCCGGG ACCTGTGGGG CAGTAACCTC CAACAGGTAG 
354651 AGTGAAATGA CATCCTTTCT CTCCACGCCT GAGTGTTCCT ATGGCACCAG 
354701 TTGCTTGTAA TGGGTGAAGA AAAAATTGAA CTCGTTTCTC CTGAGACACA 
354751 CATATGAAAG CTGACTCTAA GAAAGAAAAA CTGAAATGTG GACCATCTTA 
354801 TTTAGTTCCA GTTAAAGAGA GATGTGTTTA AAGCACTCTG ATGTGTGGCC 
354851 TTAAAAAAGG ATGGGGCTGC CTCTTAAGGC AGAATCACAC GAGCACATAG 
354901 AAATAAAAAT CAAAGCAGTT ATCATATTTA ATTTTCTTAC TGACTAGATT 
354951 CATTTTGCAT TAGCAGGGGA TGGCGAAGGA GAGGGACTGG GAATGCATTA 
355001 TGTTCTTATT TGGGTCATTT CCTTGTGCTT TGGGGAACTT TCTCTTGTCA 
355051 GCGTGCACAT GCATGCCGGG GGTGGGCGTG AGCTGGCCCC ATGCACTCAC 
355101 TGGCTTCTTT CTCGGTGTCG TATGACCCAG GAGGGAGTCT GCAGCATGGG 
355151 GTGATTCTCC ACATAGAGAG TCATTCACCA CTCTCAACCC CGTCTCCAGG 
355201 TACTTCCTGT GTTCATTTGA GAAGAGGCAG CTGACTTAAA GCCTCATTTC 
355251 TAAGCAGACT ATTAGAAAAC ATGTCCGCAT AACCTGGAGC GGCAGCTCAC 
355301 GCCTATAGTC CCAGCATTTT GGGATGCCAA GGTGGGCAGA TCACGTGGTC 
355351 AGGAGTTCAA GACCAGC^TG GCCAACATTG GGAAACCCTG TCTCTACTAA 
355401 AAATACAAAA ATTATCTGGG CATAGTGGTG CACACCTGTA GTCCCAGCTA 
355451 CTCAGGAGGC AAGGGCAGGA GAATCACTTG AACCCAGGAG GCGGAGGTTG 
355501 CAGTGAGCTG AGATTGTGCC ACTGCACTCC AGCCTGGGCG ACAGAGTGAG 
355551 ACTGTGTGTC AACAAAAAAA AAAAAAAAAA AAAGAGAGAG AGAGAAATAA 
355601 ATAAAAAAGA AAACATGTGG GTACAAACAG GAAGAGAGAG GGCGAGGGGG 
355651 GCATCCTGGT TGGAGACAGC TGGACTCACA CATGCACTTT GTAGCCACTC 
355701 ACCATAAAGG CAGCCCCAAA GGCTCTGAGA GCTGGAGGGG GTGTAAGAGA 
355751 TCCTATCCTG GAAAGCTCCC ATTTCATACA CAGGCAGACA GCCTGTGAGG 
355801 TGAATGGGGT GACGTGGGGC TGGAAGAGAA GGGCAAGGGT CAAAGGAAAG 
355851 CATTTTACCT CCTTTAGAAA CCATCATGCC CCCAAACCTA AAACTTAGGG 
355901 ACCACCGCTG CCTTCATCCA TTTGTCACCT AATTCTCTAA TATTCCAGTC 
355951 TTAGTATCTT TTGAGCCCAT TTTATTCTTG CCCTGTCTAT TGCCACTGCC 
356001 TTAGCTTCTG GCCTCCCTCA TTTTTCATTT AAATGACTGC AAGTCTTCAC 
356051 CCTCAGCTTT CTAAAATGCA GGTCCGGAGA CTTTACTGTT CTTGCTATAT 
356101 TTCCTCAACA TTCCCACACC ATTCTCAAGA CAAATCCCAA GGGTGTGAGC 
356151 ATAGGACAAA GGGCCCTCTC CATGATTTAA CTGGCTTTTA AGACTCCAGT 
356201 TGCACCACAT TACATATTTG CCAGGCACAG TTCTAAGTTC TTTACAAATG 
356251 TTAGATCATT TACTCCTCAA ATAACCTTGG GTGATGTGTA TTTTTATTGT 
356301 CTCTACAGAG GAGAAAACTG AGACACAGAG AGGTTCAGTG ATTTTCCAAA 
356351 GGTCACACAG CCAGTAAGGA GCAGGGCTGG GACTTCGACC AGACATCTGC 
356401 TGGCAGAGTC GGTGCTTTTA GCCCTGGCAC CTGCTGGTGT GGAGTGCTCT 
356451 GTATGAATCA GACCCTTCCT CACCTCTGGC ACTGTACACA GGCTGTTTCC 
356501 TCACTTTGAA GCAACCTTTG CTCCTTTCTT TGCCTGAGCA TATTCTGTTT 
356551 GTTTTTCCAG ATTCACCTCA AGTACCACAC AGTGTCTTTC ATAAAACAAG 
356601 ACCCTGTCTT ACTTGGTTTC TGTGACACAT TATACCAGAT TGAAATTACT 
356651 GATTAATTTG TCTGTGTTTC CCCAGACAAC CTGTGTGTCT CTCAAGGTTA 
356701 GGGACAGTTT CTACCTTTCT CTCACTTGCA ACTGAATCAT GGGGGCACGG 
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356751 AATGAAGGAT GGTGCAAATG GTCTTTCAAT TCTTGTCTCT GACTGTTCAG 
356801 ATCAGACGAG TTACTTATCT CTGAGATGCT TTTTATAATA GAGGTAAACA 
356851 GAGTTCTTCC TACAGCACAG CTATTGTGAA AAGCAAGTTA AATGATGACT 
356901 ATAAACCCAT AGCCCTGGTC TGGCACATAA TAGATACACC ACAAAGCCCC 
356951 CTCTGTTGAG TCAGTCTGGT GTCCTTGCAA TCCATCCTCC ACTACACTCA 
357001 TTTTTTTCTA AGCTCATTTC TCCATCTGGA CTCATGCTTG CTATCAAGGA 
357051 TGATCTCCCC ACTTCTACCA TCACAATTAT AAAAATTCAA ATCCACCTGT 
357101 CCACTAAGCC TTCCTTAACT GTGTAGCTCG CTTTGGTCTC TCCCTGTTTC 
357151 CTAAAACATA TTCTCAGTGT CAACATTCTC TGGAATGGTC CCCATCATGC 
357201 CCGCTGCCCC CCTGCCTGAA AATTTTGACT GTGGATTTAT GTCTCTTATA 
357251 TTCTCAATGA GGTTGTATCC TCTCAAGTGA TCAAGGCTAT AATTTTTATT 
357301 TCTTTACCAT ATTTACAACT CTAGGAGAAT GACACGTGTT CTCCAAATAG 
357351 TAAATACTTG CAGAGCTAAA GTGAATTGAA GTGAATTCCT TTCCCAGGTA 
357401 CTTTGCTGCC TCCTAGTGTT GTGGTGAGGC TCCAGGCTGA GAGAAGTTTC 
357451 CTCTGAGCTC CAGCTCAGCC AGCACCCTTC TGAGTGATGA ACACTGTTGC 
357501 AGACCATAGA CCTGAGACAG CTGGCTGCGG TTTGACACAG TTTAGAAGAT 
357551 GCTACTACAT CAGAGGCCTC TCTTGTCCAT TGGGTCCCTT GTAGTGGACC 
357601 CCTTTGCTTT ACTCCTGAAC TGAGTATCAG TAGGGCATCT GTGCAGGGCT 
357651 TTTAAAGACT TCTTTCCTCT CTCAAATACC TCCCTTCTTC CCAGTTTCTG 
357701 ACCTGGTTAC TGTTTGGAGT TTTTAATTCC CTCTGATCTT AAGGTTATAT 
357751 TGCAGTTCAT GTTAACATTC AAATACTATT GAGAAAAATA AAACATAAAT 
357801 TTTCCAGTTA TCTAGATTGT AAATGAATTT ATGATCCTGA CGAGAAAAAG 
357851 GCATGAAAGG TACCCACTCT ATGAAAAAGT AAAATGGGGC CACGTTAATG 
357901 AGACAGAAAG CACAAGAAGC AGAAAGCCTT GTGTATGTCA TAGATTAACC 
357951 AGAAGTTGTT TCGGCTAGGT ACCGGCAGAG ACAACATGGT ACAGTGGAAG 
358001 GAACATGGAT TTTGGTGTAA ACACTGAATT CAAATCCCAG CTTTTTCAGC 
358051 TGTTAGCTCT GTGGTGTGTT AAGTCTCTCA GATCCTCAGT TTTGTTGCCA 
358101 GTGCAGTAGG GCAGCGATGC GTAGCTCAGA GTGTGTCATG AGGATTGGAT 
358151 GAGACAATAT ATGCCAAGGC AATTAATACG ATGCTAAGGG GTTTGTGTTT 
358201 TTGAAATGCA AAGAGTTTAG TTTTGTATTG ATTCCCATTA CAAGTTTGCT 
358251 CCTGTATTTT CCGGCTGTCT AGTCCATTCA CACATCTTCA TCTCTTAGGG 
358301 AAAAGACATG GTGTCTTTAA TGTTCTCTGA ATTAGCACAG TGCCTGGCAC 
358351 TTAGGGTTGT AGTAACAATT ATGGAATAAA CGTAATGCCT ACAGCTAAAG 
358401 ATGCTGAAAC TCTGTGACTT TCCATTATAC ATTCTTTTTT CTTTCTCTTT 
358451 TTCTTTTTTT TTTAGACCAA GTCTTACTCT TATTGCCCAG GTTGCAGTGC 
358501 AGTGGCCCCA ACTTGGATCA CGGCAACCTC CGTCTCCCAG GTTCAAGCAA 
358551 TTCTCATGCC TCAGCCTCCC GAGTAGCTGG GATTACAAGT GTCCACCAAC 
358601 ACACCTGGCT AATTTTTTTA TTTTTAGTAG AGATGGGGTT TCACCATGTT 
358651 GGCCAGGATG GTCTCGAACT CCTGACCTCA GGTGATCCAC CCACCTTCGC 
358701 CTCCCAAAGT GCTGGAATTA CAGGCGTGAC AGCTGCACCT TGCCCATTAT 
358751 ACATTCTTTC TATTCTCAGT TTTTATCTTT AATATTTTAG AGTCGGAGAT 
358801 AATGACTCCC ACATCAGCAC CCCAATTAAA TAGACTTTTT CCTGTTTATC 
358851 ACCTTTCCTT TAATTTCCTC ATTCAAAGCA TTATTCAGTT GTCCTCTCCT 
358901 CTTGCTGCTT AAGCCTCTCA CCAGCTCTCA GTCAATGCTC CCAGATGCAG 
358951 CTTTCATTGC ATGTGATGTT GTTTAATTTC ACTTGTCTTA TCTATAATCA 
359001 CATCAAGCCA TTTATTTGTT AATAGGCTGT TTGGTGGCTT CACAGTTGGA 
359051 ATTGCTGTAC ATCACATTTC ATTTCCACAA GGACATCCTT TATGCCGGTC 
359101 TGCCTTATGT GGCATTCGTC TCTTCCTCTT GCCAGGCTGG GCCAGCTTCT 
359151 CCCGCTGTAC CTTCCACATC AGCTGTTCAT TTTCCCCCTG CTGCTGTCAC 
^^01 TGATAATTTC CAGGTTAAAT CTTATAAGTG GAAATTGGAT CCACAGCTGG 
359251 AGTTGAAACG CAGCTCCTTG GAGTTATTAC CACAGTCTGT TCCCGGGTAG 
359301 GTGGGAAAGT TGAAGATGAT AGATTTCCAG TGCCTGGGAA TAACAATGCT 
359351 GTCCCTCTAA CCACAACTAA GGTGAGATTA CTTTCCCCTG GGGAGAGCGA 
355401 TATTTTTACC TTTTCACTCT GAGTTTACAT • CAGGGGGATT CAAGAGAGGC 
359451 GAGTGGTTTT TCAGTAGTTT TCTGTTATTT TATGAGAGAT GAGTTAAAGT 
359501 GATAATGTAG GCTGGACAGC AAATAAGTTA TAAGTGATTG ACCAGCCTTT 
359551 TAGGCTGTCA GCTATCTCAA ATAGAGGCTT TCTTTATTTT TTTTTCCACA 
359601 GTAGAAACAT TTGATTTATT TGATTCAAAC AGTAGAAGTA TGAGAGGATC 
359651 AAATAAATAG GTTTGTTCAA GAAAAAAAAA TATTTTAGTA TTTCATGCTT 
359701 TCAAAGCCTA TGCCATAACC TTTGAATTAT TCAGCAATAA TTAATTCAAC 
359751 AACTATTTAC TGAGCAGCTT CCACTGTiTC AGGATTGGAA ATATCAGCGC 
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359801 AATAGATCAC ACAAGATTCC 
359851 AGACTGATAA TAACAAGACA 
359901 GAAGAAAAGA AAACAGGAAA 
359951 GAACAACTTT TCATAGGAGA 
360001 GGAGATTTGA GTAATTAAAA 
360051 TTTTCTCTCC ATCCCATGCT 
360101 ACCATCCTGA GGTCAGAAAT 
360151 AGATGTCAAC AACATCCACT 
360201 CTCAGGTGCA GGGGATGGCA 
360251 GGTTCTGGTC TGGCCTCATC 
360301 GCCCCCTTTC TGGGCCTTAG 
360351 TCAGATGATG TATGAAATTT 
360401 ACCTTAAACA GTCCGCGTGG 
360451 TTGTGAGGGA TCAAAATGAG 
360501 TTGTCACACT GAATCTGACC 
360551 AGAGAACACA CTATCTAATA 
360601 TCCCTTATTG CCCATTCCGG 
360651 GACTCTTCAA AGGTTTTCAG 
360701 CCCCTTGGAT ACAACAGCAT 
360751 TTTGCTACCA CCCTCAACTC 
360801 AGATATACAG GTTAATAAAG 
360851 GTGTCCACGG GCTGCTGGTC 
360901 CTGGGCATCG AGGTTGGGAG 
360951 TAAGCCATGT TATCCTATTT 
361001 GCTCTCAGCC AACCCCTACC 
361051 CTGTCTTGTA CACACTCAGA 
361101 CCTGGACTGT GTGAAGTGGC 
361151 GTGGCAGGAC ATCAAAATTG 
361201 CCATATGCCA AGCACCATTT 
361251 CTGGCAAGAT CTCCACCTTC 
361301 GAAAAAATAG GCACATAAAG 
361351 TGAAGAATGT AATGTGGATA 
361401 TTACCCTGCA TGATTGGGAA 
361451 ATTCATTCAA CAGAGATAAT 
3 61501 TAGGCAGAAA GGATTCATTA 
361551 GAGCCTTCTT TCAGCAAGAA 
361601 TTTTTATAAT AGGTATAATA 
361651 TATAGGAACA GTAATCATTC 
361701 TATAAGTGTA GTTGTAACCC 
361751 TTTGGATAAA GCACGTCCCA 
361801 AACATCCTAT GTGATGGAAC 
361851 TAAGAGGTTG AAGAGAGAGC 
361901 TCCAAGTTTC AAGGGAAAAG 
361951 GAATTGGAGA GATCTGATTT 
362001 TGGAGCATCA ACTTAGGGAC 
362051 GCTGCTACAG AGGTCTAGAA 
362101 TAGCAGGAGA AACAGTGATA 
362151 ATATATATAT ATATATAGGC 
362201 AATAGATAGA TAGAAATAGG 
362251 ACTACTGAAT TGGTTGTAGG 
362301 CCCCTAGGTT TGGCCTGAAG 
362351 GAGAAGGAAG ACTTTGTGGG 
362401 AAATAAATAG CTCAGTGTGA 
362451 CAAATGAGGA TGTGAAGTAA 
362501 GGGCTGGTAT TTGGAAGGCA 
362551 GAGGATGAGA GCAACTAAAG 
362601 ACTGAGTTCT GGGACATTTT 
362651 CCCAGCAAAG GAGGGCAAGA 
362701 GAGTGTAGCA TAGTAAAATT 
362751 TAAGATGACA GAGAATTAAA 
362801 ATTACCTTTA AAGAGTGTGG 



ATCTTTTTGA AGCTGGGCTT CTAGTGGACA 
AGATACATAA GATGATTTCC AATGTACTAG 
ATGTGAGACC ATCTTGGGTG AGATCATCAG 
GTTGATGTTA GTGAATCCAT AATTGCTAAA 
GGGGTCCGGG TTGATCTATA TCTCTATTTC 
TTCCTCTCTG ACCCACCCCA GCCCCTAGTA 
ATCAAGATAC CTTAAGGGGC TTTCTCCTTA 
GAGGACTGAA TAGAAGATGC CAGCGCTTGG 
AGAGCTGAGC TCTGGGCTGT GAGGAGTTCT 
ACTGGGGCTG TGTCACCTTG GGCAGGTCTT 
CCTTCATTTT CATTTGTGAA GTCAGTGTGA 
CAGAATCGTA TAGAAGATAA CCTACCTATT 
GAGGAGAAGG TAGCAATAGT TGACATGAGG 
GCTCTCAGTG CTACCCTACA CCCCTCAACT 
TGGAAGGTAC ATGTAGGGAC AGTTTCTTAT 
AGTGGAGACA TGGCATGTCC TCAAAACATC 
GCTTTAACAT CTGCCCCTTC CAATTACTTT 
TGCACCTTAA ATGGTTTGTA CATCTTCTGC 
GCACGGACAT TTGCTTTTCT GATTAATCCC 
AAAGCATTAC AGGTAGGAAA ATGGACTCAA 
AGAAGAGGGG AAGAGCCAGG AGCACAAGCA 
CCAGCCAGTG AGTGAGCAAA CATAATGGAC 
AATGCTGAAA AAGAAATGCC • AAAGTC AATT 
CCTGGGATTG GTGCTGGGTA CTGAACAGAG 
TCACCCAGAG CACAGTATAG AGTATAGACA 
ATACAATAGT GATGTGTTTC TGTGTGCAGC 
TTCTCCTGGA CATGTCAACT CTCCTTGCTT 
GTTAGACTGA AAGTGTCTAA TGAGCTACTC 
TCAGCAACAG GGACTCAGCA GTGAACAAGA 
AAGGGTCTTA TATTGTAGTG GGGAAGACAG 
AAAAGCAAAT CAATACATTT CAGTTGCTGA 
AGAAATAACT GTATGGGTAT TTGTGTTGCT 
AGTCTCTCTG AGGGATAGAT ATTTGAGTTC 
TAAGTACCTT CTATGCTGCA GGCCCTTTTC 
ATGAATAAAC CAAACAGCCC CATACACAGG 
GAGACAAAGA AAAAATATTT AAAGTGTAAT 
AACTTCATAT TTAATGATGT ATAACTGATA 
TTATTGCTTT ATTGAGCAAT GCACATCATA 
AAATACTAAG AGGGTGGAGC CACTTAAAGA. 
AGAAGAACAT AGTATCCTCA AGGCACAAGT 
ACTTGGGCCA GGAAGGCATG GATGGGAAGA 
CAGGAGCCAG ATCATCAAGC CTTGATTTCA 
CCTGGGAAAG TTTTAAGGAG ATGGCAGAAG 
AAGGTTTAGA AGATTCTCTC TGGCAGTTGG 
AAGAAGGGCA GCAGGGAGAT GGGTGAGGGA 
GAGAGAGGGT GGTGACTTGA ACTAGGGCTG 
CATAGATGAT ACATATATAT ATATATATAT 
AAATAATAGA TGATAGATGA GTAAATAGAT 
TAGATAGATA GATGTAGACC TGACAGCGTA 
ATGGGAAGGA AAAGGAGAAA TTAAGGTTTT 
AGTTGGGTGA ATGTTGGGGC CATTTATAGA 
TAGTGAAGAG GGAATTTGGA GGACACAAGT 
GCCTTTTACA GTTTGACTTT ATACAGTATG 
ATAGATATGA GACTACAGTT TTGAGGAGTG 
TCAGTGGTGA AATGGCATTT AAGCTAGGGG 
AAAGTGTATA CATAGAGAAG AGGTTTGAAG 
CAGATTTGGA GGTCTAATAG ATGATGGAGT 
AAGAGTGGAT TTTGACAGAG GAAGAAAACT 
CTAGATTCAT GGGAAATGTT GCTGAAAAGG 
CATTTTCCTT GAGAAGGGAT ACAGTTACCG 
TGGAGGACCA AAACCAACTG GAGTGGGGTG 
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362851 AGGAGAAAAA ATAGAAATTA GGGAGTGTCA ACAGATAGCA AAGACAGCTT 
362901 TTTTTTTTTT TTTTTTTTAA GAATTTTGCT TAGAAGTAGA GAAAAGTGCT 
362951 ATTTGTTGGA TCTCTAGTTG GAGTCAAGAG TTAAAATAAA TTATAGAACA 
363001 ATATGTAATC ATAAAGAGGT GAGAAAAGTG TGGATGAATG CTGTGGCTTC 
363051 CTTTTTCTCT CTAGTAGTAG TCCCTAGTTG GTCTGCAATG AACTTTTGCA 
363101 AGAACGAGAG TGGGGAAGAC TCTTCATGAG GCTGATAGGC TGTAGCTCTC 
363151 ATGGTTTGTC TTTAACCCAC AGGTAGTGAG CAAGCCATTT GCTCTCAGGG 
363201 AAAATGTGGC TTTCCTGACA GCCCCAGATG ATCCATCTGT CTCTGCAGCC 
363251 TATAGGTGGG GCATATTGGA AACAAAGCAT TTAATAAAAT GTAATAGTTG 
363301 TGACTCGGGA CATAGAAGAG GCAGATTGAA AGAGAAGTGG AGACCAACAC 
363351 GCCATCTTTC CACATAAATG GCTTCAGAAA GCTGCAGACT CAGAGTGGAT 
363401 TTGTTCGTGC AATCCTCACC TCTCCTACTA GTTAACAAGT CGGCAAGCCA 
363451 GCAGTGAGGG TTTGGTTTGT ACCCTCAGTT ATTAACCATT AGTATTTGTG 
363501 TCATGTGCTT TTGCCTGGGA AGGTGTCCAG AGATTGGAGA GTAGTGTTCA 
363551 GTCTTCAGTG GGTAGAAATT GATCAACTGT AGAAAGAAAT AAGTTATCAT 
363601 GGATATGTAC ATCAGCTGTG TCTTGTTTCT AATGGCTCGG CTCGCGACTA 
363651 GCTGAAACAA TGAAAGGAAA AAGTGTGGGT AATTTTCAAG TAATTGATGT 
363701 ATTTGGGGCT CATGTAGGCC ACAATCATGA TAGTCACCTA ATTTGATCCT 
363751 TATCTCAGAG CCAAATCTTC TCACAGAGTT GTTGACATGG GATGTATTTT 
363801 GATCTTGATT TTGGAGTCTT GAATTCTTTT TCCAAATGTT CATTCTTCTC 
363851 CCTCTGTATG TCCCTCAGCA CTAGATTTTA TCATCAGGCA CAGAACATCT 
363901 TTCTGAGTAA TCTCTTCCAT ACTCCAGGTG GCGAATTAAA AACTATGTGA 
363951 GGATGACTGG AAGTAAAACA TTTAGGCAGT ACTTGCTGCT CTGGAGACAT 
364001 TCAAAGAACA ATAAACACTT CTAATGTAGT ATTTACTGAC CTCCAGAAGT 
364051 GATAGCTGAT AGTGAAGACA GAGTGCCCGG GCGCTCCAGG CTATTGCTCC 
364101 ATGTTGAGAT TGGCTGAGAG GTCTTCGGAA GTCCTCAGAT TGAATGGTAG 
364151 TTTCACTTTA AACCAAGCAA AAATTGTCAT CAAAATTCAT GACTCTCTCC 
364201 TACCCCAGCT TTCCTCCAAC TAATTGGGCT CTCTTTCTGG TCTTGAATGC 
364251 ATCGAGTTTT TATTTGTCTC AGTGCCTTTG CACTTTCTGC TCTCTCTACT 
364301 ACAAAGCCCT TCCCAGGGCC CTACACATAG TTGGTTAGCA CATTCTCGTC 
364351 TTTCTGGCAT CAACTTAAAT ACCGTCCCCA TAAGGAGATA GTCTTTGAAC 
364401 AATCAATTTA AAGTAATCCA TCCATGCCTT TTATTTTCTT TATGACACTT 
364451 ATCTCAACTT TGTTCATTTT GTGTCTCTTC CATCAAACAA AAAGCTCCTC 
364501 TAAAGCAGAA ATTTTCATCC TTCCTGCTCT ATGCTGAGTG CCCAGCCCAA 
364551 TGGCAGACAC AGCTGACACT CAGTAAATAT TTGTTAAACA AAATGGCAAG 
364601 TGTGAGTGCT GTAGAGTATT ATATGTTTAA ATATTAATTT CAAAGATTTT 
364651 TAAGACTTTT GTTGACTTCT TCAACACATT CTCCTCCCAT GCTATAAGCA 
364701 TCTGGGGGCT CTTTTCCACC TCTTCCCTCT CCCATCACAA TGTCTGACAC 
364751 ATAACAGATA CTAAACAAAT ATTTGTGGTA TCCATGAGTG ATTCCAATTA 
364801 ATTTGTATTG GGTATTTCAA TATAGTCATA AGTACATCTC CATGTCCAAG 
364851 TATATCTCCA TGCCCAAGTA TATCTCCATG TCCGAGTACA TCCCCATGTC 
364901 CAAATACATC TCCATGTCGA CATACGTGTC CATGTACAAG TACATTCAAG 
364951 TACATCCCCA TGTCCCACAT ACACCTGAGG ATACCGAAGC TTAGTGTGGT 
365001 GGGCAGAATA ATGCCCCTTC CTCATAGATG TCTATACCCT AGTCCCTGGG 
365051 ACCTGTGGAC ATTGCCTTGT ATGGAAAAGG GATTTTGCAG ATGTGACTCA 
365101 GGTAAGGATC CTGAAATGGG GGGATTATCC TGGATTGTAC TGTTAGGTAC 
365151 AATATAATCA CAAGAGTCCC CGCATGAGGG AAGCAGGAAG GTCAGAACCT 
365201 GAGAAGAGGC AGAGGTCAGA GTGATGCAGT TGCTCCCTGG AGGACACAAG 
365251 TGAAAGAATG TGAACAACCC CTAGAAGTCA GAAAAGGCAA AGAAATGAAT 
365301 TCCCCCTTAG AGATTCCAGA AGGAATGAAA TCTTAGTGAC ATTTTGATTT 
365351 TAGCCCACTG AAATCCATTT TGGACTTCTG ACCTTCAGAG CTGTAGATAA 
365401 TACATTTGTG GCATTTTAAG CACTAATTTT GTGGTAATTT GTCACATAAG 
365451 CAACAGGAAA CTCATACACT TAGTTCCTCA CATTGAACCC TACCCACTCT 
365501 AGCTCTCTCG CCATCCCTGT TCCCTATTAG CTAGATTCCC CAATTCACAC " 
365551 ACCTGGAATC TCCAGAGAGG GAGTTCTAGA GTGTGGGACT CTCCCACCAA 
365601 TCTGTGTGGA TCATGAGTAG GTAGATCAGA GATCCATGGA AGGCCATCAA 
365651 GACCATCAGT GGTTGGAGTA GGTCATGAAC GTGATAGGGC CTTGAGGGTG 
365701 AGTAGTTACT CCTCTCCCAA TTTGGTAAAG AATAAACTTC TTCCAGGAAA 
365751 ATCACAGATA TTGTCATAGA CTGCAAGAAG AAAAAAAGCA AAAAAAAAAA 
365801 AAAAAAAAAA CAAAGGAAGG AAAGAAGAAG GAAAGGACAG AGAGAATTAT 
365851 GACTGTGTTG TGTCTTTCTC TGATACACAC TCCTGGAGCT TCTCCAAGCC 
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365901 TGTACTTGCC TTCTATGCCA TATGCAAATG CTGTAGAATG ATTTCTTTCA 
365951 GTGCTAACTT TATTTAATTC TATCATCATG GTTTTTACTT TAGGAAATTT 
366001 TGGTTCCAAT AGTTTAAGAA TAACATGGAA CTGACCTTAG AAATTGTCTA 
366051 ATCCATATTG TGACTGGCAT TATGAAAAAG CTATTATTGG TGAACCATGG 
366101 ACTGCCCTGT GGGAGTTGGG AGGAGAAGGA GGGCAGAGGG GCAGAGTGGA 
366151 AGAGGAGAAG CATTCACAGA TGAATTCAAG GTCTTTGCAT AAATGTGAGG 
366201 TTTCTTACCA CTATAATTTT TTTTAAATAA AATTAAAGAA GCAAGCAAGC 
366251 AAACAAAACA AAGAGAAGAT GGTTCCTGAG GACAGGTGTT TTATCTAAGA 
366301 CATGTTGGAT GCCCAGGATC TCCTTATTAG GACTTAGGAT ATTAGGCCAC 
366351 AGAAACCAAA GATTTGTGAG GGAGAAGTTG GGGACCAGTG TCAGGTAGTC 
366401 AGGCCTTTGC TGGTAGGAAG ATGGGGCTTA CATACCCTAT AGCCACACAG 
366451 GCTTGGCACT GGTCCTGGGT GAGGAGAGAG TGGGGCCCAG GTATAGGCCA 
366501 GCCAGGCTTC CCCAGATGAT GCGTCTGTAC TTTCCAGTCT CACTCTGCTC 
366551 CCCTCACCAC CTCTCCACAC AAGCCTCAGT TCTCAGTCAC TAAGCAGCAT 
366601 TGAGAGGCAA TTACAGGGTG CAGAACTCTG CCCGGGCTCC TGGAAGAAAA 
366651 GAATATGTGC CATACTGTGC TAATGCAGGC TCTTCTATCA TGCCTCTAAT 
366701 AAATGGGTTC AATAAATAAA TGGGTTCTAT AAATGGGTCC ACAAGCGTAT 
366751 GTTTGATTCA CTATACTTCA TGTTGTCATT TTCGCTGGCT CCTTGGGGAG 
366801 AAGTCTCATG CTTCAGGGAA AGGCACGATT CACCAGAAGC CCCACCTCAA 
366851 ACTTCCTTTT TGTTTTTGTT TTTAAAAATG AACTCTTGTG TTCTCACAGC 
366901 TCCATCCAGC AGGAATCCTA AGGAAGCTCT GAGGAGAGCC ACAGCACCCC 
366951 CTTTCTCCTG CCTGGGTGTC TCCTGTCCCT CCACACCAGC TCCGTCCGCT 
367001 CCTTTAGTTT GGAATATTTC CTTAACAGAC CTGATAGCTC TGCCTCCCTA 
367051 CTCCCTCCTA AGCCCTCATC CGCAATCCAC ATGTCTGACC TCTCCAAATA 
367101 GATTACTAGT TCAGTTGGTT GCTGGACATC TTAGCTCTCT TGCTATCCAT 
3 67151 ATGATATTGG ACAAGGAGAA AGACAGGGTT TTTCTTTGTT TTTTGTTTAA 
3 67201 GGAGAAAAGT GATATAAATG CCCCTAGTCC TTGTCAAGTT TTATGTTATT 
367251 ATCCTTTCCT CAAAATAGAG CACACATGCA CACACCACAT ACACATACCA 
3 67301 CATACATCTG CACACATGTG CACATAAACA CCCACACACG TACACACCAC 
3 67351 ATACACATAC ACACACGACA CACCACAAAT ACATGTGCAC ACATATACAG 
3 67401 AAAGTTTTTG CTGTAGATAT GTCATTCCTT TAGCAACTAT TTTACACTTT 
367451 TTGGAGTCTT CTCTCACTAT TAGAAAGAGC TTTGCGGTTT ACTTTTAGAG 
367501 AAAAAATCCA CCTCCTCCTT TAGCATCTAG TCATCAGTTG GGAGACAGTT 
367551 TTTTTTTTTC CCAATTAGAA GGAAAAAAAT TCCAAGTCTA TTTTAGAAAA 
367601 GAAATAAATA TTCTCTTTCT CAAAATGCAT TTTCAGCCAA AATATGCCAT 
367651 CTGGGGGATG TTAGAAATTT CAGTCAGTTT ATCTTTATAA ACTATTGAAG 
3 67701 CTGCACCTGT TATAAATGGC CTCTGACACA AAACATTCTA GAAACTTCGA 
367751 AAACAACATT GAACATGACT TCTTGACTAC AGATTTAAAC TTATTTCATC 
3 67801 ATACGTGCTT CACAGACAGT GAGCTATAAG GTGTTTAATT AGGGCACAGC 
3 67851 AGTCATTTAA TTTAGCAGTT AAATAACTTC AACTAAGATA ACGGGTTAAT 
367901 TTTGAGGAGA GTGATAAACT CCACTCCCTA GGTCTTGTTT TATTGGACTA 
367951 TGGGGGTTAG CCCTTAGCAC TGGATTGGCT TGGGAATTTA TTTACTGATG 
368001 AATTCAACCA ATATTACAGG TACTTTTTTA AAGGCAGTCA GCTGAAAGTT 
368051 GGAGGAAATT CTGTTTCAGA TTCTAAGAAA ATAAACAAAT CTGCTCACCT 
368101 TCCACTCTAT GCTACCTTTT CTATTTTGAG AGAATAAACC TCTAAATCTT 
3 68151 TAGACTTTTA TGAAGAACAC TTACCATCAT TTGAATGTCT CTCTTCATTT 
3 68201 CCAGTCTATT CTGAAAGTAT CTTCCTCCCA CCACTAGGCT TTAGTAAATA 
3 68251 TTTATAACCC AATCATCTCA AGTCCATTGG ACTCCATCAC TGGTATGTTT 
368301 CAATTATTTA AAGTCAGAAT AGCATCTTCT GGGCAGCTAA TGACGTCTCA 
368351 ATAGATATAC ATTATCGAAA CATATTGTTA TTTACTTAAT AAATCAAGCC 
368401 TGTTGGGGAA AGTAGATGCA ATTCAAGTAC CAGAAATGAG GAATAATGAT 
368451 GGCATAGATC ATTTCTGAAT CACTCTAGCC CATTTTACAG CCTGTAAAGT 
3 68501 ACACTGAATT CCATAAAAGT GACAAATGGT ACAACATTGT AATAAATAGC 
368551 TATAGTTTGT GCATTGTGAA TTGGAGCTTA GATTCATATT GAAACTCTCT 
368601 GTCAAAAAAG AAGAAGATGA ATTAAGTATT AAAATCATAC TCTGTAAAAG 
368651 GAATTATATA TTTTCCATTT ATCTTGATTT ATTATATGAC ATCATGTGCT 
3 68701 TAAGTATATG TAACCTTTCC AAATTGAGTT GATTTAATGT GAAGGGAAAA 
3 68751 AATGTTAAAA AAGAAAATTT GGTGTATGTT TAGAATTTTC TTTATTTTCC 
368801 TGCTATATTT TTATAAAACT TTTAACTGTT CCATGGAAGC TTAATCATAT 
368851 TACATTTTTT AAGTTGAATT CCCAGAAATA TGCAATAACA TCTACAGGGC 
368901 TTATTAAGAA CAAATGTGCT AAATCACAAT TAATATGACT GTTTTTCATT 



WO 02/086113 




122/219 



CT/GB02/01887 



368951 CAAATTTCAA CGTTTGCATA TTAAGAACAC ATAAAACTGG CTGGGCGCAG 
369001 TGGCTCACGC CTGTAATCCC AGTACTTTGG GAGGCCAAGG CGGGCGGATC 
369051 ATAAGGTCAA GAGATGGAGA CCATCCTGGC CAACATGGTG AAACCCCATC 
369101 TTTACTAAAA ACACAAAAAT TAGCTGGACA TGGTGGCCGG CACCTGTAGT 
369151 CCTAGCTACT TGGAAGGCTG AGACAAGAGA ATCACTTGAA CCTGGGAGGC 
369201 AGAGGTTGCA GTGCACTGAG ATGGTGCCAC TGCACTCCAG CCTGGTGACA 
3 69251 GAGCAAGACT CCATCTCAAA AAAAAAAAAA AAATTCCAAA ACTTTCAGTG 
369301 AAATATTAGC TGTATGAAGA GAGTGACTAC GCTGAAGGGT GAATCTAGAC 
369351 TTCCAAAATG ACAGACCGAG GAATGGGCAT GCCTTGAATT TTTCCAAATT 
369401 TCCTGGAATT TAAGTGATAC ATCTACCGTT CTATTTATCA TTGAAAAATA 
369451 ATTCTTAGAG GAATAAGCAC ATGTAAACAA TATTCTATTG ACATCTGCTA 
369501 TTTACTCCCT CAAGTTAACA ATACAGCCAC AAAATGATTA AGATGAACAA 
369551 ACCCAACTCT TACTGCATTG CTTCCTAAGG AGATGAAGTG GTTTTTCTTA 
3 69601 GGCATGTGCA TTGGCTCCAT TTTTTTTAAA GTTGGTTTGC AGCACACTTC 
369651 AGGAGGCCAG AAGCTAAACA AGCAAGTGTG AGCGTTAGCC TGTAGGAAAG 
369701 ACAAAGGCAC AAAGTGAGGA AAAGAAATGT GTCTTGGTCA CCCTCACCTT 
369751 TCCCGGTGTC CAACTGACAT TACACCATCT CTCTGTCCTC TGCTCATCTG 
369801 CCTCTTGCAT GTTGCATTCT TGAAGCTTTC CTCTGTCATT TTGCTAATTT 
369851 CTGGGATGGA GCTCGCAGCT CACCCGCTGG GATAGCAGGA GAATGGGTGT 
3 69901 GAAGTATCAA GGGCTGGGTT GGCCACAATG TCCTCCTTCG CCTTCCAGCA 
369951 ATCGATTCCT TTGTGCTCCA TTTTTTCCAA GTGTATCAAA TGGATCCAAC 
370001 GCTGTCCTGT GCGGAGTAAA AGAGGTATTC TAATGATATT CTAATGGTAT 
370051 TCAAATGATA TTCTAATGGG AGGTGTAATT TTATGAATAC TTCTTGCATC 
370101 TCATCTCCAC AATTAAAGCA TAAATTTTTT GGTTTGAATT CTGTCAACAG 
370151 AGACTATTCA CTAGGCCCCT TCTCTACACA AGGCTAAGTT GCTATATATG 
370201 TCCTATTTTA TATAATTATC TAACCAAGCA GAAACGTAAA TATATATCTT 
370251 CATTTAGCTT ATGAAGGAAA TAATTTCCAA GCTCACTAAG ACCATAGCAG 
370301 AGGAGAAAGA CTCAAGACCA TGGTTGTCTG ATGGCAAAGC ACCTACTCTT 
370351 CCCCCTATTT CATATTGATT CTCATTAACA ACTAAGGTTA TTTAGAAACT 
370401 ATATATTGCA TGATATAAGG TACCCGGCAG ATCACAGGCC TTCATAAAAG 
370451 TTTTAAGTTA ATAAATCAAC CACTGAGGGA AAAGGTTTTT TCTCTGAAGT 
370501 CCTGGGTTTT CCACTGGTAC CAAGGTATTA ATAGTATCTT ATACATAGTA 
370551 GACATAAGGG AAAGGAAAAT AGAGGAGAAA AGGAAAAAAA ATTGGGGAGA 
370601 AATGAGGTAG ATGAGTGAAG GAAGATGAAT GAAAAGAAAG ACAAAAGGCA 
370651 AAGCAGGGAA GGAGAAAGAA TATCTCAGGA ACGATTTCTG TTCCAACCAT 
370701 TGATAAACTG GTACAAGGAG TCTACTGCTC TCAGGCTGTG GTGCACTGTT 
370751 TGTTTTGTTT TTGAAATTAT AATGAAAGAA GAGAGAGATT TAAAGATCTT 
370801 CACTCACCAG AAAAGTTTTG TCTCCTTTCA TCCATTAATA ATTTTATTTA 
370851 ATTTCAATCA CTTAGAGTGG TGTGGACTAA GGATGCTCTT ATTGCCTTCA 
370901 GGGAACACTG CAGAGGCCTG GAATGAGTTA TATAAGCAAT ATACAAGCAA 
370951 ATGTCAAGTG TAAGCCTCCT CTAAGCTCAG TGGGGCCTCG TGATGGAGGT 
371001 TATAGCAAGA ATGGCTGTGA GCCTCTGGTT CAAGAAGGGC TAATCCTGGC 
371051 TGTGTTCCTG TTCTGCCAAG ACAGGAACTC TGTCACTATC TCCTTAGCCT 
371101 TAGTGGACTA GAGCATTAGC TCTTGTATTT TCTTTTGCAC AACAGCTGCT 
371151 GGGCTTCTCT TTTAGGCCTC AAAATGCCTG GTATAGTGCG TGACATGAAA 
371201 TAAGTCCTCA GCAAATACCT ATTGGATTGG ATTGATTAAG TGGGAAAGAA 
371251 TGGAGTAATG GCTTTGTCTT ACCACATTGC CCACCTGATC TGTTCTTTCT 
371301 TTTTGTCCCA AGCACTTAAA GCATTCTTAT TGAGTCATGG TGACAAGCTA 
371351 CTCTCTGGAT AAGGCTTCTT GAAATGGATT TTAGTCAAGC TTAGCACCAA 
371401 CGGCCATTAG GAAGCCAACA ACACAACACA TTTCTCCTCT GTACTTCGCA 
371451 GATTATTTAG TGAACTCAGC CATGCTTCCC TTGGAGGAGT GAAAACAGAC 
371501 ATGGAAACTG GGGTGAGGCA CACAGCGCAG CATACTTCTG TCCTGGTGTG 
371551 GACCTCTGCA TGAAGAAATG TGATGTGCTA AATAAATTCC TATCCCCAGG 
371601 AGGCTTGTCT TCTTATCTCA GACCTGCCAC CTATAAACTA CGTGACTTCC 
371651 TGGTGTTATA ATTTACTCTT CTTGTGCCAC AGATTCCTTG TATTTAAAGT 
371701 GGATATTTGC TAAGTGTCTG CTATGGGTCA GGTGCAGTGC TAGGCACACA 
371751 TGGACACTTA ACTCATAGAA TAGATTCATT CATTCATTCA TTCATTCATT 
371801 CAATGTATAT TATTAAGTAC CTCCCTCTGT GTTCCAGACA CTATTCTGGT 
371851 TACTAGAAAT ACTGCAATGA ACAAAACAGA TAAAAATAAA ATATCTTTAC 
371901 CCTTATGGGG GTGGATGTCT ATAATTCAGA CATATGAGAA ATGGACTCTG 
371951 TGTTCAGTAG CCATGAATAC TAAAGAGAAA GAGAAAGCAG GCGAGGGAGA 
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372001 CACAATTTTC AGGGGTTCAG GTTCAATTAT CAGAAGCGTT TTTTCCTTAT 
372051 CTCTTGTCTT TAGATAGCCT GCTTGGGAGA CAATTCACTG AGCAAACCAT 
372101 GCCACTGTCA GGAAGAACAT TCTGGGCTCA GGGACCAGCA AGAGCAAAGG 
372151 AGCTCAGCCA TATTAACGAC TGTGCCAGGA TATTTTACTA CACAGTGAAA 
372201 AACACTGTTT AAAATAATGA GGCATTATCA CAGAAAAAGA AATGATCACA 
372251 TTTAAAAAAA TAAATTTGTA TGTAGAAATA ATTTTATATG TGCAAGAAAG 
372301 TTGCAAAGAG AGTGCAGAGA GTTCTCACAT GCTCCTCGAC ATCTTCTTTT 
372351 CATTAGCATT TTGCGTTATC ATGGTGTATT TGTCAAAACG GAGAAACCAA 
372401 CTAAATTATG ATCCTGCTGG TTTCTCCACT AATGTCCTTT ATCTGTTCCA 
372451 AGATGTAATC CATAATACAA AATTATATCC AGGAAAGTCC ATTTTAAACT 
372501 TAGGTGTTAA GTTTTATTTC TAATTCTGAT TGAATGCTAT TTCCGTGTAC 
372551 CAGGTAGCAG TTGAAAGGCA GTTGGCAAAT ACTTGTTAGG CCACCAAGTC 
372601 TCCCTTAAGT GAGTCACACA CAGCAGGAAG AACACCAGGC AGGTGGTCTA 
372651 GAGTCCTGGG TCTTCATCCC AGCTCTTCCT CAACCAAACA TGTGTCCTTG 
372701 GACAAGTTTT CCTCTGGAGC TGAGGCAGAT TTCTTCTGTA CAAAATTTAG 
372751 AAATTTAGGT TAGATTTGTA AGATCCCTCC AACCTCTTAA ATTCTGCCCT 
372801 GGCATTATCA TGGCATCAGT ATCTTCATAC TCAGAAGCAT CAAAAATTAT 
372851 GTGACTGATT TCCCTAATGC CAAACGTTGG TTCATTACAG CACCAGAGTG 
372901 GGCTGAGAGA GCTTCAGCAC AACCCTTGCC TCTGACTACC ATACTCCACT 
372951 CCAGGCTCCC TCCTTCCTAT CTCCCATCCT CCAACAGCTG GCTCCAGACA 
373001 GTAGGTGTTA ACGGCTGAGA TCACTAAATG AATAGGAACT ATTCTTAGTA 
373051 TTTTACTTGT ATTTCTTATA AACACCCTTA TGAGGCAGGT ACAATTTTTT 
373101 TGGCTTATCT ATCTTGGTAG ACAGAATAAT AGCCCCCTAA AGATGACCTT 
373151 GTTGAAATCC TCTGAAGCTG TGAGTATGCT GTCTTACAAA TCTCACATCT 
373201 GCAAAGGGAT TTTGCAGATG TGATCAAAGT AAGGATCTTT TTTCCTTAGT 
373251 TGGAGATTAG AATATGGGGA GATTATCCTG GGGAGATAGG GAGTTATTCT 
373301 GGTGGGACTG ATATAATCAT AAGGGTTGTT TTAAGAGGGA GGCAAGAAGT 
373351 CAGAGAGCAG AGGAGAGGCT ACATTGCTGA TTTTGAAGAT GAAGCAAGGG 
373401 GCCATAAGCC AAGGAACGTA TGAGGCATCT AGAAGCTGGA AAAGGTAAGG 
373451 AAACAGATTC TTTCCTAGAG TCACCAGCCT GTGACATAAT AAAATTACAT 
373501 TTTTTAAAGC TACTACATTT GTAGTAGCAT ATTGGCATAT TGGTTTCTAC 
373551 ATTAGTAGTA ATTTGTTACA ACCTCAATAA GAAATCAATA TGCCCATGAA 
373601 AGTTAAAAGA ATGTGTCCAG AGCTGGAAGG CATTATCCTC AGCAAACTAA 
373651 GGTGGTAACA GAAAACCAAA CACTGCATGT TTTCCCTCAG AATGGGCACT 
373701 GAACAATGAG AACTCACGGA CACAGGAACG GGAACAACAC ACACTGGGGC 
373751 CTGTGGGAGG TCGGGGACTG GAGGAAGGGA GAACATCAGG AAAAATAGCT 
373801 AATGCATGCT GGGCTTAATA CTTAAGTGAT GGGTTGATCG * GTGCAGCAAA 
373851 CCACTATGAC AGATGCTTAC CTATGTAACA ACCTGTACAT CCTGCACATG 
3 73901 TACCCTGGAA CTTAAAATTA AAAAAAAAGA GAGAGAGAGA TACTGTAAAA 
373951 AAAAAAAAAA AAAAGTGTCC AAAATTGGGT AACTGGTAAG TGGTTCAGTC 
374001 TAGATTTGAG CATAGGCCAT ACTGCTTCAC TAATGTAATG AAAAGGCCTT 
374051 CCCCATAGTT TTTTTTCTTT TATTGATACA CACTATTTTA TATATTTATG 
374101 GGGGCACATG TGAGTGTTGC ATGCATAGAA TGTATGATGA TCAAGTCAGG 
374151 GTATTTGGGG TATCTACCAC TATGAATATT TATCATTTCT ATATGTTGGT 
374201 ATAAAGTTCT AACTTCTGGT TACTTTGAAA TAATCAAAAT ACGATTGCTG 
374251 AGTATAGTCA CTCGTGAATG GAACTGGAGG TCATTATGTT AAGTAAAATA 
374301 AGCCAGGTCT ATAGGCTTTT TGTGATGCTT GAACGAAGTA CACAGAGAAT 
374351 GCAATGCTTA GAACACACAA ATATCATTGA GTTCTGTAGT TACCACAACA 
374401 ATTTGTGGCT CTAGTCCACG ATTTCTCAAT CCTGTCACAC TCACCCACCC 
374451 ATCCCATTAT CGCCTCAATC CCGAGCCTCT GTCCTCGGAT CTGACCTGTT 
374501 TGACTTTTGT AAGACGACAA TCAGAATTAC AGTGTCAAAA ATGACCTCAT 
374551 GCAATTGTAC AGCAACTTAT GCTATTCACC TTGCATAAGT AAACGCGATC 
374601 ^ ATATTTAGTT TTGAGTCAAA CATCCTTGGG AATGAGTTAA AGAATGATTC 
374651 GCCTTTGCTG TAGGTGTACC TATCCCTCTC TCTTGCTCTT TCTGTTCATT 
374701 TGCCTTCTGC ATTCTACCTA TTCTTACATG TTTGTCTCTG TATCACCAAA 
374751 TCTATTTTGG TCCCTTCACT TCAGCCTATT TCTTGTTTTT GTAGCAGTCC 
374801 AGAACAAAGC TACAATTTAT TATTTAATAC ATAAATAGAA TTTATGTGTT 
374851 TTTAATATAA AGATCAGAAC TGTAGGAGGT TTTATATTTT CCTTAAAACT 
374901 CCAGGCAAAC AAGATTTTTG ACTAAATTGC TTTTTTATAT ATGATTCTGC 
374951 TTTTTCTCAT CTTATATGTT AATAGTCTTG GGTATGAAAA GTACTTAAAT 
375001 TTAGAGACAT GGTAATTTAG AGACAGAGGC CTGGTTTAGA TATGCTGACT 
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375051 TTTTAACACT GAGAAAGTCT CTTAGAGTCC CTGAAGATGA AAGTGTAGAT 
375101 TTTAGTTACT AATTCACCTC TTGTGCTTGT TATCCATATG AGGAGCTGGT 
375151 AAGGGACAAA TCCTGCAAGA GTGAGAAAAA GCACCCATTT TTQTATTTGC 
375201 TCTACATTTG GGCTTAAATT GCTGCTAAGT CATTCCCCGA GGGAAATCTG 
375251 GAGTGAGTGC ATGTTACAAA TAAACACTCC CAACTTTGCT TTCCACAGCC 
375301 TCAGCATTTT CAGCTCCATA AGAGACGCTC TGAACAAGCA AATGTGCTTT 
375351 TAACTTAGCT CTAAAATATG TCCTGATAGG TGGGAGCTAT TTTTAAGACA 
375401 ACAGCAAGGA CCCGGCAGAC TGATATTTGC TGGAACCCAA ATTGCCTCAC 
375451 ATCTTCTCTG ACTTTGCCCA ACAATGAGGC CCATCAGGTT ATTATTGTCC 
375501 CCTGAGAAAA TATATTTTGC ATTGCACTGT ATGTTACTGC TCCCCTTAAA 
375551 GATGTTGGAC ATTTTTAAGC TGCGTAATGA GGAAGAGAAA GTGGGTGGCA 
375601 GGATAAAGAA GTGGGAAAAT GGTAAAGAGT CCTGCATGAG AAACAAAGAA 
375651 ATCTTTGCAA AGTGAAACAA GAGCAAAAGA GTAAGATAAT TTAAACTAGA 
375701 GTGATAAATC ACACTCACAC ACACACACAC GTGTGCACGC CTATCAACCT 
375751 ATCAGCCATC TGTTAATCAA TTATTTATTA ATTATCTATG TAGCCATCTA 
375801 TGTCAACTGT CAATCAATGT ATCATTTATT TATCTGTCAA TCTATCATCT 
375851 ACCATGTAGG AGGCAGGAGG GAGAATGCTA TATTTGAACT AAAGCAGAAT 
375901 AAAATAGTAA AACTAGTCTC TGTGTTTCTT TTGTCAAAGA GGGTGAATAT 
375951 GCCATGAAAG TCTAATGGGG GATGAGAAGT TTGTCTCCTT CTCTTCTTTC 
376001 CTCTCAGCAG CACCCCCTTT CCCACTTCAA GTCCTCCTCA AGGCCAGGTG 
376051 CTATATATTG AAAACAGATT TAATAAAAAC TATTCAAAAT ATTCCTGAGA 
376101 GACCAGGTTT GTCACCAAAG ACTTCCCAAA TGATAACAAA TCATTTTGTT 
376151 ATCATATTCT GAAAAGCAGA AAATGTGGAC TGGAAGCCAG GAAACCTGGA 
376201 GTGTAGTCTG GGTACTATAT CTCACTCAAA GATAAATGGG AAAGGGCAGT 
376251 TCTCCAACTT CAATATCACC GTGGGGCCCC CAAATCCAAC CAACATATCT 
376301 AGAAGCTGTC TGACTAGATG TTCGAATATA ACTCAAAACA ATCCAACATA 
376351 AGATCCCTTA GGTAGGAGCT TTGGCAATGT GTCCACAGTA GGCAGAATGA 
•376401 TGGGGGAACT TGAAATTATG ATATGAAAAG TAGATGAAGG TGGCCGGGCA 
376451 TGGTGGCTCA CGCCTGTAAT CCCAGCATGT TGGGAGGCCA AGGCAGGTGG 
376501 ATCACGAGGT CAGGAGATCG AGACCATCCT GGCTAATATG GTGAAACCCC 
376551 GTCTCTACTA AAAAAATACA AAAAAAAAAA ATTAGCCGGG CGTGGTGGCG 
376601 GGCGCCTGTA GTCCCAGCTA CTCAGAAGGC TGAGGCAGGA GAATGACATG 
376651 AACCCGGGAG GCGGAGCTTG CAGTGAGCAG AGCTTGCACC ACTGCACTCC 
376701 AGCCTGGGTG ACAGAGTGAG ACTCTGTCTC AAATAAAAAA AAAAAAAAAA 
376751 AAAAAAAAGA AAAGTAGATG AAGGAATTAG AGATGTTTAT CCTGGAATAG . 
376801 AAAAAATTCA TGGATGGACT ATATAAACTC TGTGAAAGGC TGTCATGCAC 
376851 AGGAGGAATT TGATTTGTCC TATTCTATTG TAAACTGGTG GTTCTGACAC 
376901 GGCTTCCAGG GACTCATCTA CTGGATCATA GTTAGACCCA TAAGTAAAAA 
376951 TGACTGCTAG CTGTATACTG CTTAAATAAG AACAGTTTGG GGTGATACTT 
377001 GATTTTGTTT GGATTCGTGC CCCCACCCAA ATCTCATGTT GAAGTGTAAT 
377051 CTCCAGTGCT GGAGACGGGG CGTACTGGGA GGTGATGGGA TCATGGAGGT 
377101 GGTGTCTAAT GGGTTAGCAC CATCCTCCTA GTGCTGTTCT CCTGAGAGTG 
377151 AGTGAATGAG TGAGTTATCG TGAGATCTGG CTGTTTAAAA GTGTGGTGCC 
377201 TCCTCACTCT CTCTTCCTCC TGCTCCTGCC GTGTGAAGTG CTGGGCTTCC 
377251 CTTTTTGCCT TCCACCATGA TTGTAAGTTT CTTGAGCCTT CACAGAAGCC 
377301 AAGCCAAATC CCCTCATGCT TCCTGGACAG CCTGTGGAAC CGTGAGCCAA 
377351 TTCAACCTCT TTCTATAAAT CACCCAGTCT CGCATATTTC TTTATATCAA 
377401 TGAGAGAACA GACTAATACC TTTAACATAC GTGTAATGCC ATCCCCTTTC 
377451 TCTCATGATA TGCCACTTCA TAGAGGGCAG ACGCCACCGC TAAAACTTAT 
377501 GGAGGGAACC AGAAAGTGGT AGGGTACGTC TCTACAGAAA AATGTGCTTT 
377551 CTATTGATGA AAATTCTGAC AATTAATTGA ATTGCTGTGG TAGGTAGTAA 
377601 ACTCTGTCAC TTAAGGTATT CAAGCAGAGG TTTGAATGAA CTCTCTGTCA 
377 651 CTCAAGGTAT TCAAGCAGAG GTTATTAATT AATATAATAC AATATTTATA 
377701 CTACATATAT TGTGTATATA TTCTGTATAT GTAGTATATA ATATTTATAC 
377751 TACATATATT GTGTATATAT TGTATATATA TAGTATATAA TACAATATTT 
377801 ATACTATATA TTGTGTACAT ATTGTGTACA TGTAGTATAT AATATTTATA 
377851 CTACATATAT TGTGTATATA TTATGTATAT GTAGTATATA ATACAATATT 
377901 TATACCACAT ATATTGTGTA TATGTAGTAC ATATATAATA TATACAGTAT 
377951 ACATATTGTA TTATATTAAT GAATAACCAC TACCATCCAA ATCTGTCTCT 
378001 CACATACACA AACATGTATG CACAAGGAAG AAAATAAACA TCATGAAAGG 
378051 GCAGGGATCC TTTCTCATTA TAAAAATAAC ATCATCTTTT GTAAACAATT 
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378101 TAACTTTCAA AAGATAATAC AGTTGGAAGT AACCAAATTG TGACTCTATA 
378151 GTTAAATACT TTGAATTGAT CTAATAATAT AGAATGTAAG GTGTTACCTA 
378201 GATGAGTATG AAAACAGAAT GAAGCAAACT CACCAAAACA TGAAATAGGC 
378251 CCCCCAAAAT AATACAACTT GTTTATGGGA AAGAAAACAC ATTGTGAAGT 
378301 AGACGAAAAG ATCGTATCTA TCATTCTATA ATAATCACCA AAATTAATTC 
378351 CTCAAAATCT CAGAGTTGGT AGTGGCATGA AGATAAATAA ATGCCTATTA 
378401 GAACATGAAA GGTTCAGAAA TATGAGAAAC ATGATCAGAA GAGGCAGATC 
378451 TCTCCACCCT ATTCAATCTG TCCAGCTCCC GTGACATGTG CTAAATGAGA 
378501 ACTATTGTGT GAGCTTTCAG GGAAAGGAGT AAACCTACAG CGAATGATAA 
378551 AGTCTCCAGA TATTTGAGGT TAATGAAGGG GAAGGAGGAG GAAAAACAAC 
378601 ATTTGTGGAG AGGCTACATG TGCCAGCCAT CGTGATAGGC TTTTAAAATG 
378651 TGATATTTCT CAAAACTGGT CCATGGGGAT GACACTGTTA TCTCTTCTTA 
378701 CAGATGAAGA AACTGAAGTT GAGTGGTCAT GCTGAGATTC ATATCACAGC 
378751 TTTTAGAAGC AAAGCCTGTG CTCTTTCTAC TCTACCGTTC TCGAGAATTT 
378801 AAAGTCGTGT AGAAAACACA GGCGTGGAAC CTATAATATT TAATGGACAA 
378851 ACAAGTGTAT TTTACTTTTT CATTTAGTCT TTGTTTGTTT GTTTATTTCT 
378901 GAGACAAGGT CTTGCTGTGT TGTCCGAGAC TAGAGTGCAA TGGTGTGATC 
378951 ATAGCCCATC GCAGACTTCA CCTCCTGGGT CTAATGATCC TCCCACCTCA 
379001 GCCTTTCAAG TAGCTAGGAC TACAGGCACA TGCCACCATG CCCAGCTAAT 
379051 GTTTCCAGTT TTTGTAGAGA CAGCATCTTG CTTATGTTGT CCAGGCTGGT 
379101 CTTGAAATCC TGACCACAAG CAATATTCCC ACCTTGGCTT TCCAAAACAT 
379151 TGTGACTACA GGCAAGAGCC ATCACACCTG GTTATATTTG TGTTTCAGAA 
379201 ATTAAGAAGG TTTGAGAAAA AATAAATAAA TGAAAGATCT TCATTTGGTC 
379251 AAGTCCAAAT GAAGTCATAA CTTACAGGCA TCTTGGAGTG TGAACAAAGA 
379301 GAAAGCACCG ATAGTTCCAC ACAATCAAAA GGGGATTTGG AAAGATGATC 
379351 CTGAAAAAAA CACCTGGGCA GGGTTCTTTA TAGGTCACTT TGCAGAACAA 
379401 GAGAAGTACA AGGAAAAGAG AAAGATATGG TCCCAAGCTA TCAAACGACG 
379451 AAGCAGAAGA TAATCCAAAA GCAGGTTTCT CAGCTCAGGA AAGCAAGAAC 
379501 ATGAACAATC TCTTCAAAGG GAAATAGATC TTGGGCCAGC TCACCTAGTT 
379551 CCTGAGTTCT AGAGAGTTTT AAGGGCTGCT AGGAAAGCTG TCTTGATCTT 
379601 GTTAGGGCAC TACCCTGGGG AGAAAACACT CACATATGGA AGAAATGCAA 
379651 TAGCTAATTC TTGACTAACT GATTGATAGG ATGGGAACAA TGAAGGGAAT 
379701 GAGGGTGAGA GGCAACTAAT ATTAACTAAA AAGTAATTGC TGTCTTTTCA 
379751 TCAGGAGTCT GCTGCAAAGG TTGCCTTACT GCAAACAAAG TGGCTGAAAA 
379801 CAGATAATCC TTTGCTAGGA CATCTCCACT TTTCCAAAGG ATGGAGGCAT 
379851 CAAAGAATGA TTACTACATT TGATCGTGGT GAAAGAGGAA TCATGACTAA 
379901 TACCAAGAGA GGCTCAAGCA GATAAACTGC TGAAAGAAGT ATGGCATTTT 
379951 ATTCCAAGTG GATCAGAAAT ATCCTGTTTT ATGCAATGAT GGGGAAGAAA 
380001 ATGAAAGACT CAGGACATTT CTGGGCAAGG GACAGTGGTG TCAGTGAGAA 
380051 GGGGCAGACC ATTTTACCAC TTTATATTAC CCGCCTCATG TGACTAGACC 
380101 CAGTCTGCAA GCATCAGCAC TGTGATCCAA TAATCTGTTA CAAAATTCAT 
380151 AGGTTTTTCC CCCTCCCATT CAAAGTTCCC CTGGGACCCC CTATGGAAAG 
380201 AAATAGGAAG AGGAGAGCAA AATCTCCCTC TCTAGCAGGG TCAGCATCCC 
380251 CGCCCTGCAC AGCGTCAGCG CCGCTTGCCC ACCACTCTCC TAAAGACCCG 
380301 TCATTATGAA AGGAAAAGAG AAGCTAATGG CTGACTTGAT GGAAACATTC 
380351 TTGGCACTGT GGACAGGCCT GCTTGGAGGC CCTGTCTGCT GAGCCTTGGG 
380401 AATGTGAAGA GGAAAGGTAG CTGGTTAACA TTGTTTAACA CAGAATTAAG 
380451 TCATAATCGC ACCTGAGGCA TATTCCAGAT CTraGTCCCC CCAGCACCTG 
380501 TCACATTTGA TTTAAAGATT TCTCAAAGTG AAAAATGGTC TCCATAAGAT 
380551 AAAGACAACT CCCCAATAGC AACCGCAACA CAAAACTGCT ACTTACCACG 
380601 ATTTTCCAGG GAAACAAGGA AAAGGGCTAG ATTATACAGC AAGCATTATG 
380651 TGAGAGAATG CAACTCTTTT TCTTATACAT TTGAAAACAT TGGAGGGAGG 
380701 CAGTGAAAGT TTCTCCAAGG GCTTTACTGG GAGAACATGA AGTGCTGGGA 
380751 GACCCAGGAT TTCCATGAGT CCATTGGGCA GGGCACAGCG AGGCAGGCTG 
380801 CCTATGTGGG TTATACCTGC TCTTAAGTAA GGCAATGGAA GGTAACAGAA 
380851 GCCAAGAAAC AGGATCCATA AACTGAATAG TTCTCACTCC TCAAAGCCCA 
380901 ACTAAATAGC AACAGCAGGG AGACGAAGGG GCTTGAAGAA TAAAGATGAA 
380951 AAATGCATGC TCCATTTATT AGTATTTGAG TTTTCAAAAG TTGTATGAAA 
381001 AGATGGGAAG CAATGGGACA TTTGATGACG CTGAAAATGG AAAGTGGGGA 
381051 TTTCATCGGT AAGAGTGATT GGTGTGCAAT TTAGGAGAGA AAACCGAAGA 
381101 AGAAGGAAGC AAAAGAAAAA AATAAAGACA GGAGACAAAA GAATAGCTGG 
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381151 CATTAGAGGG GAAGGCTGGG CTGGCATGTC ACAGGATCAT GTTCCCTCTA 
381201 TGTTGAGATG TGGGTGGCTC AGTCTCTGGG GATTATATTC ATAACCTATA 
381251 TAGAAAAATA AGGCAAAGCC AATCAGAAAG GGCATTAACC TCTGGAAGAC 
381301 GAGGATGTGG ATGAACTATA TGCTATTATT GTTTGGAAGT AATGCATTTG 
381351 CACTTCAACT TTAATGACAG AGGATGACAA AAATTAGAAA GGTAGTTTCT 
381401 AGTCTTTTCC ACTATAGTAA AGGACTGATT TCTTATAGGG AAGTAATTTC 
381451 TCCTGACACA TTATATTAGG TAAAGATTGC ATCTTATATG AATTAATACA 
381501 TTTAAAAACA AAGAGAACTA TTCACCTTTT TTTGATTTGT TTCAGTTTAA 
381551 GGGGC TAAAA TCTGTTAATC CGCTACTATG TGCTGTCTCC TTTGGTTATT 
381601 TTATTTGCTC TTAACCACAA ATTTGGGGTG GTAGTCTTCT GAGATTGGTG 
381651 GAAAACTCAT GATTTTCTCA AAGTCATGGT GTTCATACAA ACACTCAGAA 
381701 ATCACCCTCA TTGGGTTCAG GGAAAGGTAG GCAATGTTTC ATTCCAAGCC 
381751 TGGCCACTGG CTGATTATCA TCCCCACCCC CTCCACTTAT TCTGCCTCCA 
381801 GACAATAAAC TTAAAGGTGA AGGCAGGGAA AGGGGAATGA AGAGTTCAGG 
381851 AATATCAGAG ATGCGTGATG AAATAATAAC AGATGAACTG GTGATTATGA 
381901 TATCAAAAGC ACCACTACAA TTGTTCAGGA TGTAAAAGAA CTAGGATAAA 
381951 GTAAAACTTC TTTATCAACA AACTGCTGGT TACCCCCAAT GTAGAGTTAT 
382001 AAAAAGGAAG AGATGGCAGT GCCAAGATGG TGGTGGTGGG ATATCAAAGT 
382051 CCTAACAGTA AATAAATGGC ACATTCAGTG AGCTTAAATT AAGAGAGTTA 
382101 TACTGAATGG ACTGTTTACA AACATGTGGG AAGAGTTAGA GGATAATTAA 
382151 ACATCCAGGG ACCAGTAATA GTGAGAAATC CAATTTTTCA AGGGGCAGAG 
3822 01 ATATTAAAAA GTGATTATCA CATTTGTTTG TGTCACTACC TGTGGTCTTG 
382251 AAGAGGGAAG AGGAGGGAAT CGTAGTACTG GAATTTAATG AGAGCTATTG 
382301 CACCAGGAGA AAGGTGGCTG GAAAGGAGCT ACATTCTTAG AGGGACACAG 
382351 TGAAAATCTG TGCCTGGCAA GAAGGAAGCC TGGGGATGGG GTGAACAAAT 
382401 ACCCCTATCT CTCTGTCTCT CTTTGATCCA ACCTTGTGGT TTTCTTACAG 
382451 GGGGACAAGG GACTTTGAGT AAACAGCCCA CAGAAATCAC CCTCTTTGGG 
382501 TACAGGACAA GGTAGGGAAG AGTGGAGAAA AAGTCTGGAG TGGCAAGCAA 
382551 ACAAACAAAC AAACAAAAAC TAGCCCACAG GACTGTGACT GAGCTAATTA 
382601 GCACACCACT CCTTCCAGGC CACCCTACAG AAGCCAACAC TAAACAAAAA 
382651 CACAGCATCT GCTCTGCAAG ACAGACTCAA GATATGTTCT TTGGCATGTC 
382701 TTTCCAACAA AATTACTGCC TCTGGGTCAT CCTGTGTAGA AATTCTGGAG 
382751 CTACCAATGA ATTGGGAAGA GTGTTTTTTT TTTGTTTGTT TTTTGTTTTT 
382801 TCATAGAAGG ATGAGTAAAG AAATTTCACT TGTTTTAGAC AAGAGGAAGG 
382851 CAAGAAGACA TAGTTAATGG GTACGGCATG GTAAAAGGAG TACAGAAATT 
382901 AAGAAGAGCA GGCAAATAGA TGTGATGGAG AAAAATATAT GGTATGTTAT 
382951 CAAGAGGAAA GAGACAGAGA ACAACGAATC ATGATTTGGG AATAAAGTTG 
383001 GCAGGACTGC AGAATGGTGA TATGTATGTT TTCTCCCTCA TCTGTGCAGC 
383051 AGCTTTCAAC ACTGAATTAG GAAAAAATGT GGATGGTGAT GTATATTACT 
383101 TTATTTACAC CTCACAGCAA CCTTGCGAGA TAGGTTATCC CCATCTTATA 
383151 AATAGGGAAG TTAAGTAATT TGTCAAAGAG CACACACTTG GTGATAGGGA 
383201 AAGCCAGGAA CCAACCTCGC TTTTGTTTGT TTGTTTGTTT GCGAGTTCCC 
383251 TTGCCAACAG GCCTTCCTGT AGAGAACATT GGGAGGACAT ATAAATTGAA 
383301 AGCAGAGTGG AGGTGGCCAG TATCACATAC CTGGGACCCA CCGGACTCAT 
383351 GGGCTTCCAA TAAGAGTAAC CAGAGGGGTC CATAGAAAGT CAGAGCTGCT 
383401 GCCATTGATG ACGAGGCCGG AGGGCTCTTT TTTGCCATAC TTTCTTAAAG 
383451 CACCATGGCC CTATGCTCTT AAACATGTCT CTATTTTTTT TTAAAGTAAC 
383501 AGGCATGACA GCAACTGTAT TGGTTTACTA GAGTTGCTGT AAACCACACT 
383551 CTGGGTGGCT TATACAACAG AAATTTGTTC TCTCACGGTT CTGGAGGCTG 
383601 CAAGTTTGAG ATCAAGGTGT TAGCCAGGTT GGTTTCTTCT GAGGGCTGTG 
383651 AGAAGGAGCC TGTTCCATGC CTGTTCCCTA GCTGCTGGTG GTTTGCTGGC 
383701 AATCTTTGGA AGAAGCATCA CCCTGATCTC TTGCCTTTAT TATCACGTGA 
383751 CATTCTTCCT GTAAGCCTCC CTCTGTTCCC AAATGTCCCC CTTATGAGGA 
383B01 CACTAGTCAT GTTGGATTAA GACCTAACTT AATGACCTCA TTTTAACTAA 
383851 TTGTATTAGT AACAACTCTA ATTCCAGGTG AAATTACATy CTAAGATACT 
383901 GCAGGTTGAA CTTCAAAATA TAAATTTTGG GGGGATATAA TTCAACCTAT 
383951 AATCGCAACC TATAAATTTG TCAACAAGTT CTAAGATAAA TTGGCAAAAA 
384001 ACATAATAGA TCTACGGTAG ATAACACAAA GGAAAACCAT GGACATTTAG 
384051 AACACTGCTA AGAACTATGA TAACCAGTAT CTGTACAAAT GTAATTGGCT 
384101 CTGGTGTTGA AGAATCAATT ATACAAATGT ATGTAATGCA CAAGTATATT 
384151 TTATCATAAC CTCTGAGAAT AAAATGAAAT ACCATATGTG ACAGCACTTT 
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384201 ACAAGCTGTG TGGTATTCAA ATTAAGGCAT TAGTTTTATT GCTATTACCA 
384251 TGTCTTTATT TATATAGGTA GCTTATTTTT ATTGTTACAT TGGGCATTTC 
384301 ACATTTTTAT GGTCCATTTC TTTTTATTTC AGTTACATCC TAGATTAATA 
384351 TCATCTATGT CTATTTCTTC TTAGGAAACT GAGTTTCTCC ATTCCTTTTT 
384401 TTCCTTGATC ATTAGAAAAC ATCTTTCCCA ACTCCCATTT AAATGCAAAA 
384451 TGGAGTCCTG GCACTGGGTT GCTCAAATAG CCAACTCCAT CTAAACTTAG 
384501 CTCACGACAG AAGGAACTTA AACTGTCAGC AAAATCAGCA CACTGACTGC 
384551 AAACCAGTTC TCATAGAGAT CAATTATAAA TTGAGACGGT GTTCTGGATG 
384601 CTTTAGTCAG ATTTGGCTGG TACCCCAATG ATCTCTGACT CTATTTCTTC 
384651 AGATACTAGT ATGAATTTTC ATCTTCAATT ATAACCTGCT ACCAGGATGG 
384701 CATATACCTG ATCCTCTTCT ATTTATTATT TATTTTGCAG AAAATAAAAA 
384751 TAGGAAAAAA ACCCTATGTT TGTACTTTTA CATTTAACTC AAAACTCAAA 
384801 TATTAGTGAA TTCCTTTGAT TTATGTGAAA ATTAAAGTTG GATCTAACAT 
384851 AGGCATCCAG CCTCCTTTTT TTGACCTTTC GTCATACTCT CTTTGCTACT 
384901 CTCAAATTTG GTCACCCCAC ACTGGAGACT GGTAATTTGC AAGTTAATTC 
384951 ATTCATCCAG AATTTATTGA GCACCCATCA TTTATCTTGT ACTCAGCTAG 
385001 CAGCTATAGA GGAGATAAGG CTAGTTGTAA GTATGGGATT TGGTGAATTT 
385051 AATGTTATTG TTTTGATTAA TGACTGGGCC CACTGGTTTC CGATATGCAG 
385101 TACTTTGCAA TTTTGAGGGG CACTGACTTG ACATATGCGC TGGGCTTTGA 
385151 GTGGGAGCCC TTTATAAGTG AGGGAGTGGA GTGAACTGCT CAGCGCCCAC 
385201 AGCTCAGTGG CTTGATTGTT TTCCACTTGC CACAGCCTTG GCTCTCCTCA 
385251 ATTTTTAATT TTCTGAATTT TATAGGGGGA AATAATGTAC TGTAATGGTA 
385301 ATCATGGAAA GCCTTAAAGT GTGTCTCCAA AAATACTGAA ACTCTCTTCT 
385351 CCTGTATAAA ATAAAAAAGG TTTGCAATCT AGATTCAGAG TTTAGAAATA 
385401 AGTACCTTGA CATTTTCAGC CCCTAAGAGA ACCTTTTACC CCCTAAACTT 
385451 TCTGTAGAAT TTATAACCTA TAACCAACAA AACACCAAGT TATTTCTCAA 
385501 GCAGCTATGC AGAAGCTCTT CTATTTGGAA AGGGAATTAA CAAAATCAGT 
385551 ATAAGACTTG GGAGCTTTCT GCAAGGAGTT TATAAATCTC GTCGGAGAGA 
385601 AGGGACAGGT TTCTTTTTAT. AGCATCAACA CAAGTCAGGA AAGCAGGAAA 
385651 TGATAAGCCA TGTTATTTGT TCCGGAGTCT TTAAACCCAG ACTCTACATC 
385701 TAATCACCAT CATAAAAATG CCTTTTTTTT ACCTCCAGCT TTATAGAGAG 
385751 GAGCCTCAGA GTGGGCCATC CTGGAGGAGT GCACCTATCA GTTGAGAAAT 
385801 AGGGATGACA AGGATGTAGG CAGCTGTTGC ATTCTAAAAT GTGAATTTGT 
385851 TTTGTGAAAG CCTTTCATTA ACAAAATTAT GTGCCGAGAA AGGGCAATGA 
385901 CAAAACTGAC CAAATTCCTT CCCTTGTAGG TCAGATAACA TTTAAGAGCT 
385951 TTACATGTCT AACAAACTTG AGGTTGCTCA AACTGGACAA AGGAGGAGAT 
386001 TAAAACACTT TCTGTGATGA CATAAAGCAG TGCTACGGGT AAGGCCTCAG 
386051 TCTCCTGCCA GCAGAAGCAA ATGTATTTTC TCCATTTAAA AGAGTTGCCT 
386101 TCTTAACCCA CACTTCCCCA GGCATTCGGT CCTGATAGGG CTTGGTTTGA 
386151 CTGATTAATT GTAATTCTGC TCCATACAAG GTAGACAGTG ATGAATTGAT 
386201 ACATTAAAGA TGTGATTAAT GGTTACTTTC TATTCAGTGT AATTATCAGG 
386251 TGATAGTTTC AATGAAAAAT GC-TGGAAAAT GAAACTTTTC ACTCTAGATA 
386301 ACTATCCACC CTGTCTGGAA AGAAAAATGG TTCTCTATTT TCAACATTTT 
386351 ACTCTAGATC AAGGCCAAGA GAGACAGAGA GACACAGCAC ACTGGCTTTG 
386401 TGAGTTTTAT AGAATGAGAC GGATTTGGAA TAATGCCTGT CTTTAAAGAA 
386451 ATTCTTAGAG AGGAAATCTC TGGATAATAA ACTTTAAATT TTATGCTAAT 
386501 CCTGGATTAC TTTGCCAGCT CCTGCCTGGT CTTTCTGGTC AATTCTGGCA 
386551 TTTAATCTAT CCTGAAGATG GTTTTGGATT AATTTCCCTA AAGCACCGCA 
386601 TTAATCATAT CACATCTGTT TTTTTTTTTT TTTTCGGAGA CCTCCAACAA 
386651 CTTCCCACTT AAATTATAGG AAGAACAAAA CATCAACATG ACCCTCAAGA 
386701 CCCTTTGATA AAAGGCAAAC ACTCTTTGTC AAACTGTTTC AATTCCTGTG 
386751 CGCTGTTCTT TCGAGGCTGT CAGCTCACTG CCATCTCAGC TGGCCATGAT 
386801 CCCCCGAGGC CATGCTTTGC TTTGCCTCTT TTTCTACATG TTTTTAGCTG 
386851 GTAGAAAAAA TTGTGGACGA CTTAAAAAGG GACCAAAAGA TTACCAGGTT 
386901 TCCTTAATGG AGAAATTTTG TGCTGTTTAG AATGCATCCT CTGATTCATC 
386951 TGATTCTGTA GGGTTTTGGC ATTTCATTGT TTTGGAACTA TTATATAATT 
387001 TTGTGGAGTT AGAAGTTAAC TTGCAGAAAA TTCATAGAAA TATAAAACAT 
387051 CATTTTCTTT TTTTCTTCTC TTTTTTTTTT TTTTTTTTTT TTTTTTTGAG 
387101 ACAGTCTCTC TCTGTCGCCC' AGGCTGGATG GAGTGCAGTG GCGCATCTTG 
387151 GCTCACTGCA AGCTCCACCT CC.TGGGTTCA CGCCATTCTC CTGCCTCAGC 
387201 CTCCCGAGTA GCTGGGACTA CAGGTGCCCA CCACCACGCC TGGCTAACTT 
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387251 TTTGTATTTT TAGTAGAGAT GGAGTGTCAC CGTGTTAGCC AGGATGGTCT 
387301 CGACCTCCTG ACCTGGTGAT CAGCCCACCT CGGCCTCCCA AAGTGCTGAG 
387351 ATTACAGGCA TGAGCCACCA CGCCCAGCCA AAATATAAAA CATTATTTTC 
387401 TATATACATG CAGATAGGTT CTGTCCAATG GAGATTCAAT GGACTTTTCT 
387451 AACCTGCTGT GGAAGAAGCC AGGTTTTTCT GTGTTAAGCA AATTCATGTT 
387501 GGCATTTTTG TTGGCCTTTT ATAGTTACTC TTTGGGATCT CCTTTGAACT 
387551 TAGGACATGT TAATTCCCAT CTTCCTAACT AATGAGCTAA ATAAAATCTT 
387601 TGACATGTAC ATTTTATATC TGTGTAAACT CATACTCATG ATTTACTTAT 
387651 TTTAAAAATG ATCCTTTAAT ATAATCTACC CAATTTAAAA CTTAGACAAG 
387701 TTTTTAGAGC AGTTTTAGGT TTACAGAAAA ACTGAATGGG GCACACAGGG 
387751 AGTATGCATA TATCTGCCAC ATCCTTTCCC AGTTTTCCTT GTTAACAACA 
387801 TACTGTACCA CACCAGAGAG GTACACTTAT TCCAATTGAT GAACCTGTAT 
387851 TGACACATCA TTATCGTCCA AACTACATAG TGTACTTCAG GTTTCACTCT 
387901 TGGTATTGTA CATTCTATGG GTTTGGATAA ATATATAGTG ACATGTATCC 
387951 ACCCTTCTGT CTCCTACAGA ATAGTTTCAC TGTCTTAAAA ATCCTCTGCA 
388001 CTCTGCCTAT TTATCCCTCC' TTACCCTCAA CCCCTGACAA CCACTGATCT 
388051 TTTTATGTCT CCATAGTTTT GCCTTTTCCA GAATGTCATA TAGTTGGAAT 
388101 CATATGATTG GCTTCTTTCA CTTAGAAATA TGCATTTCAG TTTCCTCCAT 
388151 GTCCTTTCAT GTATTGATAG CTCATTTCTT TACAGTGCTG AATAATATTC 
388201 CATTGTCTAG GTGCGCCATA ATTTATTCTT TCACCTACTG AAGGATATGT 
388251 TGGTTGCTTC CAAGTTTGGG AAATTATGAA TAAAGCTGCT ATAAATATCA 
388301 GTGTGCAGGT TTTTGAACAG ATATATGTTT TGAACTCATT TTGTAAATGC 
388351 CAAGGAACTG GATGGCTGAA TCATATGGTA AGAGTATGTT TAATTTTATA 
388401 AGAAACTGTC AAACTGTCTT TCAAAGTGGC TGTGCCATTC TGTATTACCA 
388451 TCACCAATGG ACAATAACTC CTGTTGTTCA ACATCTTCAC CAGAATTTGG 
388501 TGTTGTCAGT GATCTGGATT TTGGCCATTC TAGTGCGTAT GTAGTAGTAT 
388551 CTCATTTTGC TTTTATTTGT ATTTCGTTGA TGACGTATGA, TGTAGAGCCT 
388601 GTTTTCATAT GCTTATTTGC CATCTGGATG TCTCCTTTGA TGAAGTGTCT 
388651 ATTCAGATCT CATGCCTATT TTTTTAATCA GGTTGTTCAT TTTCTTACTG 
388701 GTGGACTTTG ATATCTAACC AATTAAAAAT TTGAACTCAA CTTGGCTGTA 
388751 CATTAGAATC ATCTATAGAG CTTTCAAATA ATACAGGTTT GAC TTGTGTA 
3 88801 CGTCCTTATT ACTCCAGTAA GTCACATATC ACTGGAGGGA TGGTGATGGC 
388851 ATACATAGCT AAGTCCCATG CAGAGAACTT CTGATTTGTT GGGGCCTGGG 
388901 AATCTGCATT TCTAACAAGT TCCCAGGTAA CAGGAAGCAC TTTGAGAACA 
388951 CTTGCTCTGC TTAGTATGGA TTACTAGTAG TACAGTTTGT TAGGCTCTCT 
389001 TCTTTCTCTA TATTAGCCTG TTTTAAATTT AATTGTATGT CCCATATTTT 
389051 GTTTACAATT AGCTATCTAA GTATATGTGC TTTTGTAATC ATAGCTTGGA 
389101 ATTGCGTTTT CTAGGTTAGT TTTCAAAATA ATGCTTATAT TCTGCTAAGT 
389151 CCCTGCCTTG TATTTTTGTT AACAGACTGA CAGAAATTGG TGCCTGGCCT 
389201 ATGGCCTAGT GGTCTTCAGA AATGTTCTTC CAAAAGCTTT TTAACTAAGA 
389251 AGACAAAACA CCATTTCTAC TCAACTTAGC TATTATCCTT TTAAATAAAA 
389301 CTTTTTGTAA AAATATCCCA AC AC AAAC AA • TAAGATTAAA CAAAGGATCA 
389351 TAGATTTCTC CCTAGGATTA TACTCTCTAT TAAGCACAGA ATTGAATATA 
389401 TGTATAATCA GATTTTAAAA TGAGTTTGGG TCTGCCTAAC CTAAAATGTA 
389451 GATGAGTCTG CCTCTTTCTA AAATTTTAAA TCATAGTTTC TTGCAGAGAA 
389501 ATCTGGGGAT GTTTAACTGA AACTTTGACT TTTTAAATTT GAATTTTCTG 
389551 AGGTTGCAGT TGATAATATT TAAGGTGAAT TTATTTGGCT ATAACTGAAT 
389601 CTTTGCACTA AAAGAGAGAA ATACCTGCTA TTTTAAATAA ATCCAAAACA 
389651 ACATATCTGC AAAGCTGAAT AAAGTCTATT TTCTCTCATA TAAGATAACG 
389701 TATTTCAAAG ATGATGAGAT ACGGAGTAAC CTCTTGATAG CATTTGGTTC 
389751 CCTCCTCATT GATGTTCCAC TGTTATGAAT CCTTGCTTAA TATAATGCTC 
389801 ATTCTCCTTC TAGGCTTTTC TTTTATCTCA CTATAAAATA TTTTTGAAGC 
389851 AAAAGTCCTT CTGGAGTGAA CACACACATG TATGTTAAGA CTTTAAGGTA 
389901 TTTACCTGTT AAGAAATATT TTTTTAAAAA AGAACAAACA GAAAGCAAAA 
389951 AGAGAATCAC GAAAGATTCA AATGAATATT TCTTATTCAT TTCGAAAAGC 
390001 ATGAAATTTC TATCTTAGCA TTCATAGATA GTATTCTGAG CATGCATTTT 
390051 ACCCTTTTGC GTGTTACTTT AATTTTGCTT TGAGACAAAA GCTGAAAAAG 
390101 AAGTATAGTA TATGTGTGCA TGTATGTGTA GTTTCTTCTC TTTTTGTAGT 
390151 TAGTGAAAAG AATTTACAGA TAATGGTATA TCAGTTAAAA CCAAGCTTAA 
390201 ATGTAGTGAT CACTGAGGGG CAAATTTCTG CTTTGCTTAG AGTTGTTAAC 
390251 AACCTCAGGT CACCTGTTAC ATACCTTTAT TTATTTGTTT ATTTATTTAT 
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390301 TATTATTATA CTTTAAGTTT TAGGGTACAT GTGCACAATG TGCAGGTTAG 
390351 TTACATATGT ATACATGTGC CATGTTGGTG TGCTGCACCC ATTAACTCGT 
390401 CATTTAGCAT TAGGTATATC TCCTAATGCT ATCCCTCCCC CCTCCCCCCA 
390451 CCCCACAACA GGCCCCAGAG TGTGATGTTC CCCTTCCTGT GTCCATGTGT 
390501 TCTCATTTTT CATTTCCCAC CTATGAGTGA GAACCTGCAG TGTTTGGTTA 
390551 TTTGCCTTTT TTTAAATTGA GAAATCAATT ATGAATATTT ACAACACTAG 
390601 GCAGCCAATG ACTCAGATAA ACTGCATTTG AACTTGTCGT GGCCTGACAA 
390651 ATGCAGGCTT TAGACTTTAT TGTTAGATAT TAAGAATGGT TCACTTTTAT 
390701 GAGTGAAAAA TGTGAACCAT CAGGAAACCA GAAATAGTCT AGGCTAGAAC 
390751 CTTAGCACCA CCATGAAGAC TTGAACATTA TGTGAAGGGG AAAGTTTGGA 
390801 TCTCAAAGTG GATTGAGCTA GAAGCCCATC AGGATCCAGG TTCAGCATCT 
390851 GGGCTGTGAA GAAAAATTAT, CAGACTTAGA ACATTAACTG AGAAAGCTTG 
390901 TGTTAGCTTT TTATTTTTGC ATAACAAATT ACCACACACT TAACTTTAAA 
390951 ACAATGAGCG TTTATTAGTT CATAGTGCCG CAGGTTAGAA TTTTATCACT 
391001 GCATAGCTGG GTTCTCTCTT CAGCGTATCA TGAGGCTGGA ATGAGAATAT 
391051 CATATGACCT ACACTCTCCT TTGGAGACTC TAGGGAAGAG ACCGCTTCCA 
391101 AGCTCTTTCA GATTGTTGAC AAAATCTAGT TTCTTGTGGT TGTAGGACTG 
391151 AGGTTGCCAG TTCCTTACTG ACTCTCAGCT TCTAGAGACT GCCTGCATTC 
391201 TATGCCATGT GCCTCCAGCG CATTTCCCTC TCGTTGACTC TCTCATTCTT 
391251 CTACTCGCTT TGATTGACTT ATCTGTGATT GCCAGATTCA GACTTGAAGG 
391301 GCTCATGTGA TTAGTCCAGC TCCACCCAGA TAGTCTCTCA TTTTTAAGGT 
391351 CAACTGACTT GGACTTTTAA TATCATTTGC AAATCCCTTT CCAGCGACAC 
391401 CTAGATAAGT TTAATTGAAT AGTTGGGAGT AGGCAGTACA TCAGGGGCCA 
391451 GGGACTCTGG AGCCATCTTT GAATTCTGCC TATCACTGAG CTACAAGTTC 
391501 AAACTCAACT GGGCAGCAAG CTGTCGAGCT GGCAGACACC CAGATCAAGG 
391551 AAGCATTGCT GCAGAGTGGA CATCCGATGG ACAGTGGAAT CTTCAATATT 
391601 TCAACAGCTA CACAGTGTCC CATTAGAAAT GATGGTGGCG ATTACCACCA 
391651 GAATTGGATT TTCTCTCAAC TTGATGTCCC TAGAAGGTCT GAATATTTTT 
391701 GGACGGAGTA TGACACCAAC ATGTGGGAGC AATTTCAGGT TGGTAAAGCC 
391751 TCAAGAAGGG ATAGAGAGTT TCCTGACAAA AGGACACATA GAAAATTGAG 
391801 AATGATGTGA CTTTGTATTA CAAATACACA GAGGAATTCA TAACATATAA 
391851 ACATTGTTTT ATGGCCCAGT ATGATGGCTC ACACCTGTAA TCCCAGCCCT 
391901 TTGGGAAGCT GAGGTGAGCA GATTGCTTGG GCTCAGGAGT TCGAGACGAG 
391951 CCTGGGCAAT GTGGTGAAAC TCCATCTCTA TAATAAATAC AAAATTAATT 
392001 AGCTGGGTAT CATGGTGAGC ACCTGTAGTC CCAGCTACTT AGTAGGCTGA 
392051 GATGGAGGAT CACTTGAGCC TGGGGAGGTC GAGGCTCCAG TGAGCTGGGA 
392101 TCACACCACT GCATTCCAGC CTGGGTAACA GAGAAAGACC CTGTCTCAAA 
392151 AACAAGCAAG CAAGCGAACA ACAACAACAA AATACCCACA TTGTTTTATT 
392201 AACTTGCTAT AAGCTCTTGC CCCACATCAA CCCTTGATAT TTGCCATATA 
392251 ATATGGGAGA CATATGATTA ACAGCAGAGT GGGGTTCAGC ATCCCTGCGT 
392301 CACGTCTTAC TCACTGCATG ATGTGGGCAA ATTAATCCTA AACCTTAGTT 
392351 TTCTTTTCTA TAAATTGGGA AATAAACTAG TACCAGCACA TAAATTATGA 
392401, ATAGTAAATA AAATACAACA TGTAAAAGGC TTACAGCAGA CTTTAGTTAA 
392451 GTACTTGATA AATAGTCACT ATGTTGAGAT TAAGTTGTCT GGTTCCTAAA 
392501 GTTACCAAGC CATTGCTATG GTTTGTATAC GGTTTGTTTG GCCCCACTGA 
392551 GTCTCAGGTT GAAACTGGAT TCCCAGTGTT GGAGGTGGGG CCTGGTGAGA 
392601 GGTGTTTGGG TCATGGGATG GATCCTTCAT GAATGGCTTG GAGTTCTCTT 
392651 CAAGGTAATC AGTGAGTTCT TGCTCTCTTA GTTCCCATGA GAGTTCCCAA 
392701 AGAACTGATT GTTAAAAAGA GCTTGGCATC TGCCCCATCT CTCTTTCTTC 
392751 TCTCTTGCCA TGTGATCTCC ACGTACTGGC TCCCCTTTGC CTTCCACCAT 
392801 GAGCAGAAGG AGACTGAAGC CTTCTCCAGA AGCAGATGCT GGTAACCATG 
392851 CTTCTTGTAT GGCCTGCAAA ACCATAAGTC AAACCTCTTT TCTTTATAAA 
392901 TTATACAGCC TAGGCCGGGT GCAGTGGCTT ATGCCTGTAA TCCCAGCACT 
392951 TTGGGAGGCC GAGGCGGGCA GATCACGAGG TCAGGAGTTC GAGCCCAGCC 
393001 TAGCCAACAT GGAGAAACCC CATCTCTACT AAAAATACAA AAAAAAAATT 
393051 AGCCGGGCAT GGTGGCGCGT GCCTGTAATC CCAGCTACTT GGGAGGCTGA 
393101 GGCAGGAAAA TCGCTTGAAC CCAGGAGGCG GAGGTTGCAG TGGGCCAAGA 
393151 TCGCGCCATT GCACTCCAGC CTGGGCAATA GGAGTGAAAC TCCATCTCAA 
393201 AAAAAAAAAA AAAAAAAATA CACACACACG CATGCACACA CACACAGACA 
393251 CACACAGCCT CAGTTATTCC TTTATAGCAA CACAAAATGG ACTAAGGTAG 
393301 CCATGTAGGT CTTTGATCAT TTTTAAACTG ATTTTTTTTT TCATAGAATA 
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393351 GGATCTACGT TGAGTCATTC ATTTACTGTC CATTTGTTCA TTCAGTTATC 
393401 TAAGTCATAT TTATCAAGCT GCAGACAAAG CAACTGAGAT GCCAATTTAA 
393451 ATGAGAAACA AATCCCTGGA GCTTGTGGTG CTTACAGCCT AGAGATTCTA 
393501 AAGGGGATCA TTCGCACTCT CACTCCACTT CATTCTCACT GAGATTCCAA 
393551 GAAAAACTTA AAGGGAACAT AATGGCAGTT TTAAAATAGA GCAATAAAGC 
393601 AATTTTCTCA CATTCCCTCA AACACTGTAA ACTCTTGCAA TTTAGAGACA 
393651 TAATTTTACA CATTTTTGAT TCTACCTAAA GTTAGCATGG TGGGAGCACA 
393701 TAGTACATTG ATTTAAACTA AAAATACTAG AAACAGGTAT TGTCTTTAGG 
393751 AGAATGAGCA ATAATATTTG TGAGGACCAG TGTCTAACGA TTAGGCATCA 
393801 TCTTAAACAA GAAACATATT GATTTATTAC ATGAAGGTTA CAGGCCTAAC 
393851 ACAAAAGTCA AAATGAAGCA ACATTTACTG CAATCTATGC TTTTAAGTAA 
393901 TGCCTCTCAA CAAGGTCCCA GGTGTTTTAG AAGCTGCGCT ATTTTAGCAC 
393951 AAAATAATAT ACTGCTGACT TTTCCTCATT GCTTTTGAAA CAATATGGAT 
394001 CTACTTTTGT CTCATTTACA GAACAAATAA AAAAGCCCTA GATTCAGTGA 
394051 GTCAGTAAAT AAGACCCATT TTTAAGAGAA AGCTGAGATC CTCAGAAAAT 
394101 AATTGGAGAG AATGACAGGG GTTTATATAC AAAGGCGTAA CTCATGACAT 
394151 CATTAGTATT TGTAGAAAAT TACAACTTAA CTACATCACT AGCAAAAAAG 
394201 AAGTGGTTAA GTAAATTACC AAATAGTCAC AAAAGTGCCA TATTAGGCAA 
394251 CATTTACGAA TGTTCATGAA GAAATTTTAT TGGCATAAGG AGAGGCTTAG 
394301 GGCAACGTTA AATGAAAATA AAAGCAGACT TTGGTCCTAT AGCATTGCAG 
394351 GATGACTATA GTTAACAATA ATGTATTATA TAATTTCAAA TAGCTAGAAA 
394401 GAAGATATTG AATATTCCCA ACACAAATAA ATGATAAATA TTTGAGGTGG 
394451 TGAGTATGCT ACTTACCCTG ATCTGGTCAC TATATGTTAT ATGTATGACA 
394501 TCACTATGTA TACCATAAAT ATGTACAATG TCATTTAAAA AATTTAAAAA 
394551 GAGTTGAAAA TACACATAAT ATTTGCAATT ATCATAATCA TATTTAATTT 
394601 TTTTCCTATA TAAAGGGGCT ACTGGTTATT CCATTTCAGC CGAACATCAC 
394651 TTTAGAAAAA CAGTTGGTGC AGTGGTTGAG TGCAAATTGA AGGATGTAGT 
394701 AAAGGTCTCC ATAACCATCT GGCTGAGAAC AGAAATGATT TAGTTACGGA 
394751 TTTTTGTTGT ATGTTACTGA TTGGCACTCA GCATCCAACC CCATCTCTTT 
394801 CAGTTGCGTT CTATAGAAGT AGGAAACCTA AAACTCATGT ATCTGATTCT 
394851 CTCTGGAAAC TATGGCACTA GATGTTAGAT TCTACAAGTT AGATATACAA 
394901 TTATGCTGTC CGGTAGAAAG GCAAAGGTTT TTTTGGGGGG GGGGGTAGGG 
394951 GACGTACTGT TTATAAAAAC TGGAAATAAA CAAGATTTTA TTGTAGCACC 
395001 TTACTAGTGA GCATCCTGTA CATTTTGTGA GTGTTGAAGG CAGGGGAAGT 
395051 TTCTTGAACC GGGCATTCCA CCTCAGTCTA AAACCAAGTG ATGGCTTGAA 
395101 GCAAAGGAAA AAGCCCCTCA AGTAGGTCAG TTTTATAAAG TTCTGGGAGT 
395151 AATTCCTGGA GGCCCAGCCT GTGGAGTGTT CCTCCTGTAC CCCAGTGAGG 
395201 CTGTAGGCAC TTGCTGCTCG GTGTTAACCC CCTTCCTGCT GAAAATATCA 
395251 CGTGGTTTCT CTTTTCTATC CTAAACCAAA TGAAGACAAA CAGATATAAC 
395301 GTGAATTCTG CTATTACAGG GAATATTGTA CACCTTTCTG TAATGTCATT 
395351 GATTATGTTT TTGTCATGCA TGTGTTTTGA GCTCCTTTTC CATAAAAAAG 
395401 TGCTCATGGT AAGCATATAC AAGAATACAC ATTAGTGGAT ACGAGGAGTG 
395451 AGTAGAAAGC ATGCTTGAGG GTGGTTATAA GTGTGTCTGC TACAGACTCT 
395501 AAAAACCATT TCAAATATTT TCTTAAGTCA GCACTAACTA TACATACCAA 
395551 GATAATACAT TGAATAATTA TCTTCTGGTT ACCAAATTTT TCATTCAAAA 
395601 ATTAATAAAG GAGAAATAGG TATATATTCT GGAACATTAT TACCTTAACC 
395651 AGATACTGTC TCTTTCTCTT TTCCCAAAAC TATGTATTGT TCTCTAGTTA 
395701 CCCTGGCCAA GATGAGAAAA ACTCCAACCT CTTGGTGGTT GCAGAGGCCC 
395751 TCCATGCCTA AATATTAATA GTTGGAAATA CAGTCTTTGT 'TTTTACTCAA 
395801 GCCAGCAAGC ATTTATCAAG TATATCTACT GGGGATAAAA TAGATGCTGG 
395851 AGATCTAATG ATGATTCAGA TATGACCCTA ATTCTTTAAA AAAAACTTTC 
395901 ACTATCTTGT GAAAGACACA TACCATGTAG TAGTGTACTG GTAAATGTTT 
395951 ATCAACCAGT TTTCTAAGAA AAAAATGTAT GTGCATATAT ATGTGCATAT 
396001 GTTTATTATA AATATTATTA TAAATAATTG CAGCACAATT TACAAACACA 
396051 AATAAAAATA GAATCCTCCT TATTGTAAGT TTTATATAAT GAATTGCTCC 
396101 TCAAAGTATG CTTTCATCAA TTTTACTGAA ATCTTATATC TGTAGCCAAC 
396151 TGATGGTTGC AATTGAGGAA TGAGTGTAGT • CTAAAAATGA ATGATAGTTT 
396201 ATGTTTTTGT GTATGTTAAC ACTATGCTAA CAAGTAAGAT GAAAGGGAAG 
396251 CAATTAATAT GAATCAGATC TTCACTCATT CATCAATGAC ATGAGCAATT 
396301 GCTTTTCTTA ATTGGGTAAT AATTTCTAAA TACTGCAAAG CATTTTCTGG 
396351 ATTTTTTGTG TTATTCACAA TGTAACAGCT AAAGGCACAA TACATGTTAA 
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396401 GTTTAATGTG AAGTGTTAAC ATTTTCTCAA TCACTTTCTT AAGTCTATAC 
396451 TAACAATATA ACAGTAATCA AGTTCTGCTT TGTAATGTTT GTCAGTTTCA 
396501 ATGGTATAAA TACTCCCATC ATGAACAATT GCAAGCTGCT AATGGAATGA 
1*111} TACTGAATAA GGAGTTGGAA AGAGAGGCAT AGTACAGTAC CTCATTATTT 
396601 TAGTATTCCC ACCACACAAA TAAAATAGAA ACAAGTAACT TCAAAGGCAT 
HJl} ATATAATAGT AAAATGTAAT AAGCTAATTA GGGAGTAAGG AGTTTTGAGC 
396701 ATTTATTAAA TTTATCTTTA ATACAATTTA ACTGTAATTT CATAGAGTTT 
396751 ACTTTTAAAT AATGGCTGTA TTTAACAATT AGCTTTTGAA ATTCACTAAA 
396801 ATGTAAGGAG TGGCTCTTGT GAGCCTATAA GGGCCATCTC TAGCACAACA 
1*1*1} CTGGATATGT GATTGGTGAT AAAGGTGGAA TGAAGCATGA GGAAAAAATA 
396901 GTCGTCTGTA TTTGGGGAAA ATCAGGAAAT AAATCCTGAA TAGGAGCCAG 
1^1} CCAGGCAACA CTGGGAGGTG TAGTGAGGGG ACATCCCATG CAGAGCAGGG 
397001 AGAAGGGTTA GTGGCAGAAG GTATCACAGC CCACTTAAAA CCTTCCCTCA 
397051 TCTCTTAGCA TTCGAATGGG GTAAGGTCTG CAGAAGCCAG ATTGACAATG 
397101 AACCTTCTAG GTCAAACTGA GGTGTGAGAC TTTCATCTTG TTTGAGACTA 
1*1*1} AAAGCCTTGC AGAAGTTGTA AAAGCAAGAT AATGCATTTG CATATGGAAG 
1*111 CAATGACTCT AGCAGTAGTA TGGAGCACAG ACAGTAGTAA AGCTGAGTCA 
397251 TGGATGAGGA GGCAGACTGG TCCTCAACGC TGATGTTGTC CTCATCATTG 
397301 TCTTTGGTCA GCATCTTTGT GCTTTCTCTT TACTGTCTTT TTCTAATTTG 
397351 ATCCTCCCAT TAGCAAAGGT CAAATCCCAA ATAAGTGAGA GTTTTTCCAC 
397401 CCAGCCATCC AGATTXAGAA GGCTCAGAAT GTACCCAACC TCTTCCTATT 
397451 TTTTTTTTTT TTTTTTGAGA TGGAGTCTTG CTCTGTCACC CAGGCTGGAG 
397501 AGCAGTGGTG CAATCTTGGC TCACTGCAAC CTCTGCCTTC CTGGTTCAAG 
lllll 1 ^TTCTCCT GCCTTAGCCT CCTGAGTACC TGGGATTACA GGTGTGTGCC 
397601 AACACGTCTG GCTAATTTTT GTATTTTTAG TAGAGACAGG GTTTCACCAT 
397651 GTTGGTCAGG CTGGTCTCGA ACTCCTGACC TCATGATCTG CCCACCTTGG 
397701 CCTCCCAAAG TGCTACAGGG GTGAGCCACT GCACCTGGCC AACCTCTTCC 
397751 TATTCTATGT TGACTTTATA CAGCTACATC CAAAGTGAAG TTGTTAACCA 
Hill} RAACATCArr ACAGTTACAA ATTTAGGAAG AAAAATCAAA TAATATTTGA 
397851 TGTTATTAGG AAAAGTGATA CTAGAGACTT GAAACATACT ACCCTCAATG 
397901 ACAATAGTGA TGCATGCAGA CAGTGTTGAG AAACTTAGAG AGTCGATGGA 
m„l AAATTTAGTT TATTTTAATT TAGGCTGTAT GTAGCTCAAA CATATGCTGA 
398001 GCTTTTATAA TTAAATGGAG TTTCTTTCGA AAGTGTGACA GCTCTTTGCT 
398051 TTCTTTCTAC AAGATTGAAA TTAGAGTTGG AGGACTTGTG ACTTCTGAGG 
398101 ACAGAGAAGA ATGTAAGCAG CAGAAAGTTT CTGGGAAACT TTCTAGATTA 
398151 TGATTTGAGG CGATTATTTC ACTGCCTATA CCCCAGACAC CTGCTACTTC 
398201 CTGTTTGATG GTAAACTTTT CCCTTCAGTT TTTATAAAGA GATATGGCAT 
398251 CTGGGACAAT TGCTTGGTCA CATGAATTTA TTTGAGGGAG GTCAGCATTT 
398301 CTAGACAACT TTGATATTTA GGTGATGGAA TGAAAGTCAA AAAAACCAGC 
398351 ATTGAGATTT TAATTTGAAA ATACTGAGTT AAGTCTTTAG TGTTAATAGA 
398401 GCTATGTTTC CCAGTTTTGG GATTTCATGA GGAATATGCT TGTAGAGCTG 
398451 AGGAATGATG CAGGCATTAT GAACAGAATT GAAACAAACC ATATAAAAAA 
398501 AGCTCTCATC AAAAGAAAGG AAAATAGCAA CAAAACAAAA TAAATAACTC 
Hill} TTCCCCCTGA AAAAAAAACC CTCAAAACGC TTGAAAACAT GTTCATGCTG 
398601 AAAAATAATC AAATACATTG AAATTAAAAC AACAATGAGA TATTCTTATA 
398651 TATATTCTAA TGAGTATATC AAATTGAAGG TTCACAAAAA TATATACTCT 
3987 01 CATCTACTAC TTTTATGGTG AACACTTTTA CCATATACCT TAAATGTTTA 
mi TGTGTCTTGA AGGTACACCA GTCTCTAGAA TTCATGTTCA TAACTGGTAT 
398801 ATAATACTGC CTCTTTCTGT AAACCGCTAT GTGTTGTTGG TCTTTTTGTG 
398851 ATTGGTTTAT TTAGAATTTC ATTTGTGTTC TACAACCTAT ATTAGATCTT 
398901 AATCTCTTCC AGATAATACA AAACAAAACA AAAAAATGCT TTTTCTCTCA 
398951 TGTTTCTTTT AGTAATGAGA CTTTTATCAA CAGTTAAACA TTTATTCACC 
111°°} CCAACTT CAT TCCCCAAACC ACTGGAGGCA GCCTACCGCA AAATCAAATA 
AGTTTAAAAG GACAGTAAAA GGACGAAAGT TTAGGAACAA GAAAAAAATG 
111}*} AATCATAAQA AGAAGGCTGC AGCATGAGAA GAAAATTAGA ACGGTAATAT 
m}l} AGAAACT( 3TA AGGAACTCTA CCAATGGAAC AGCAGAGCCA CGCAATTAAT 
399201 AGACTGGGCT TGGAAGTCAA AGGGAAGGAG CCTTGACCAG CTCCACTGCT 
399251 TTTATTCACT GAGTGCTAAG AGCAGCTTTG CATAGGATGC TCCACGAAGA 
399301 GCTGTGAAAT AAGATGTAGA CCTGAGGACT GCCATAGGGT CCCTTGAGGT 
Hem AAATQC »CT A ACGCGTGTCT CTGAAGCTGA TTTCCTCACT TTAGAACTTC 
399401 TAAATCCCCG CAAATCTCCC TCCAAATTTC ATTCTGTCTC TACTTTCTAA 
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399451 TATTCATCGC A6TCGGTCTT 
399501 TTTAATTGAC AAATAATAAT 
399551 TTGTGATAGA TGTAGGGTTC 
399601 AACGGGTCTA GCTTTCCAGC 
399651 GGGGGATTGG CGGGGAGAAG 
399701 CCTCTCCAAG CTCCCTTCCT 
399751 AAGAATCAGC CTAGGACACC 
399801 GCGGTTGTGA TCTTTCTCAC 
399851 TCAATGTGTC AGGCCTGCTG 
399901 ACTTGATTGG ATTGAAGGAT 
399951 GGGCGTTGTC AAAGGAGATT 
400001 GACCCACCCT TAACTGGGTG 
400051 AAAGCAGGCA GAAAAATGTG 
400101 TACATCTTTC TCCTGTGCTA 
400151 AAGTTTTTCA GTTTTGAGAC 
400201 GCAGACAGCC TATTGTGGGA 
400251 AACTCCCTTT TACATATATA 
400301 CATATATGTG AATACATATA 
400351 CATATGTAAA AGGGAGTTTA 
400401 ATACACACAC ATATATGTGT 
400451 TTATGTATAC ATATATACAT 
400501 ATATATATTT ATATGCACAT 
400551 AAGCAGGGGA TACAAATCTG 
400601 TGAAAGCCTT TGGTGGGGAT 
400651 TCAGCTGAAT TTTCTATAAA 
400701 CACCTGCTGA CTTAACCTGG 
400751 TTACCTTTCC TGTCTCTGTG 
400801 CCATTAGTCT GTCTGACTGT 
400851 TCTTCCTGCA TGTTCTCTCC 
400901 TATGCTTTCT TGTAAAAAAA 
400951 ATACATCTGG TCATAGGGGA 
401001 CCCTGACATG TATCAAATTT 
401051 CAGTTTGAAT TCACCACTCT 
401101 AACGTACACA CAGACCACAT 
401151 GCAGAAATTA AAACAACAAC 
401201 GAGTACATCG CCTACCCACC 
401251 TCAGCAGAAA ACTGACCAAA 
401301 GGTTGTCTCT GAAATATGCA 
401351 TTGGGAAGGT CCATGCCTAT 
401401 GAAAAGTAGT CTCGCTGTTA 
401451 TTAATGACAC ATAGTTTTCA 
401501 GTAAAAAGCC AACCCATATA 
401551 CATTCATACA GTGACCCTTC 
401601 AGTGGGGCAT ATCTCCCAGA 
401651 GCAGAGACCA TTGTGGCCTA 
401701 GCCAAGTATA CTACTTCAGG 
401751 TCAACACAAT CCTGGATCAT 
401801 AGGAGATGTA GTCAGTGATT 
401851 CTATTAGCAG GAAATCAATT 
401901 GGTATTTTGA TGCCTTTTTA 
401951 TTCAGAAAAA ATCTCCTGAG 
402001 CACATTTAAA AATTACTTGT 
402051 TTTTTTAGCT CTTGCTTTCT 
402101 AATTGCTTCT GCTCA'SPTGGG 
402151 GTATTACATA TTTGTCTCTC 
402201 CCTCTCGTCC CTATATATAT 
402251 CTGATGTCTA AAGGAAATGG 
402301 CCAGGTTTCC TTTATTCTAG 
402351 TGTAATAACC AGAAGTAACA 
402401 AAGATAAATG CATAGATGTC 
402451 AACTATATTT TTCCATTTTT 



TTAAAAATAT TTTTTAAATT TTATTTTATT 
TGTATCTATT TATGGGGATA CAATGTGATG 
ACATCCATTG TCTTTTGCTG TAGCCTAGCA 
ACCAGAGCCA GAGGAAGCCT TGCTTTGTGT 
TGAAGAAAGC AATGGGGATG CATTTGGTCT 
TCCTTACTTT ATAGCATGCC CACTTGCAAC 
CAAAGTGCCT TTGCCTCACT GCTGTTTGCT 
TGGAGTTGTG AAAAAGCAGA AAGTGAGTCC 
AGCTACTGTG ATGGTTAATA CTGAGTGTCA 
ACAAAGTATT GATCCTGGGT GTGTCTATGA 
AACATTTGAG TCAGTGGGCT GGGAAAGGCA 
GGCACCATCT AATCAGCTGC CAAGGAATAT 
AAAAGGGGAG ACTGTTCTAG CCTCCCAGCA 
GAGAATTCCT GCCCTAGAAC ATCAGACTCC 
TCGGACTGGC TCTCCTTGCT CCTTAAGCTT 
TCTTGTGATC ATGTAAGTTA ATACTTAATA 
TATATATGTG TATATATAAA CTCCCTTTTA 
TGTAAAAGGG AGCTTATATG TGAACACATA 
TTATATATTA TATATACATA TATATTATAT 
AAAAGGAAGT ATATATTATA TATACATATA 
ATATATCTCA TATATATACA TATATATCTC 
ATATATATCC TCTTGGGGAG AGGTCAAAAG 
ACTGCAAAAA GATTGAAGGA CAGTCTTTAT 
ACAGAGGAGT GAATTATATT GAGAGATAAC 
TTCATTGTTG CTATCTTCCA TTTTCCTAAC 
GTTCTTTCTC CATTTGCTCT CTGACTCTGC 
GTGCTTCTCG GTCTTTCTCT CATGTTCCTT 
CTCTCTCTGT CTCCCCCAGC CCCCACTGGA 
TTCCAGATTA TTCTCTCTTG GTGTTTGTTC 
CATGTTGGCC TCTCTGGGCT TTTGTAATTA 
AAAGAATGCT TCTAAATGTT ATTTGTTTTT 
TAGTTTTCAC TCTACCTAAA GTCCCGTCTC 
TGCTGGTGGT TAGATAAGCA AGCAACAGAC 
ATTGAGAAAC AAGGGAGCCT ATTTGAAATT 
AAAAAAAAGA TGCATAAGGA GCTTTCTGAG 
ACCCCACTTC AACTGGAAAT CTGTTTCCAG 
GACCCATCAT CAATTATACT TTGTGAGGAT 
GAGTGGCCCA GAAATTGTAG AAAAGAAGGG 
AAAGCCAACA AGACACTTTC TGCACCTCAA 
TTGTATTCTG TCTGATCAAT TTTCCTTATT 
TGTTTTGTAA TATTTTTGTC ATCACACATG 
GTCTGAGGTG GAGGATGGGA AAGCATAAGA 
AAACAGTCAC AGATGACATG GTTCCTGGAC 
TGAATTTCAA TGAAAGCAAT TCTATAGAGA 
AACAAAGAGC TTTTCAGTGA CCCAGGAAGA 
CACCTGGAGC AATGGATGAC CTGAATGTCC 
TATCTTTACT TTCAACCAGA TTTTGATAAG 
TGTAATCCCA CAGTGGTTTG CCAATGGTTG 
CAGAGATGTC ATTTGAGGAT GAAGCTCCTC 
GGAAGCTAAG AGGAGCTCCA CTACCAGCTC 
TTGCCTAATT TCACCTGAAT TTTTGGAGCA 
GAATCAAATT CAATTTGAAA ATATGCAACT 
CCCTGAGCTA TTTCTTGAAA ATTGTGGAGG 
CCCATTTACA TGAATTATTT ATTTTTATAT 
CTCCCTCCTA CAAGCTGTTC CATAGCAAGA 
TCATGGTGAC ATTACAGCCA TCTTCACCCT 
AAATTCTGAG GGTCTTCGAA AAGCAAAAAG 
ATTTCTAGAG GATTTCATAG CCACTAAATC 
TACATGCATG TACCTGGGAA GTTCAGTGAT 
TTTACCCATG AGTATATAAT GCATCATTTT 
CGGTGTTCAC TTTGTAGTCA TACAAATACT 
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402501 AGCCATTGCT CATTGAGTAC TGTATGTCAG GAACTATGCT ATGCATTTGT 
402551 GTAAATCATC ACATCTCTTT CCAGTTATCC TGCATAGTAT TGTTACTCCA 
402601 TTTTTTTTTC CTAATGATAA AACTGAGGCA GAGAGAAGTA AAGTAACTTT 
402651 CCCAAAGTTA CAAAATCAGT AAGTCATGAT TTGGTACAAA TATGCCAAAC 
402701 TACCAATCTG GGCCTTTTAA CTCCACTCTG CTGTTGTTAC ATTATCCTTC 
402751 TCTTGCACAT TTAAGCCATA TTATTTCAAT GAAATTCTGC AAATACTTAT 
402801 TGAGCACTTG TTATATGTAG CCATTGTCCT AGTTACTGGG GTCACAGAAT 
402851 GAAAAAGTGG AGCCAAATTA TAAGGAGGTT AGATTCTAGT AGACAAGTCA 
402901 GAAAGATGAA CAGATGAATG TCCTACAGGG CAGTACAGGT ATAATAAGTG 
402951 TTTACCACTT AACTAAAGGT GGCTCAGATC AGAGAAAGAC TTAGTACATT 
403001 ATGGTGCTTC AGGATGCTCA CTGGTGGCCA GGCAGAAAAG AGTAAAGACA 
403 051 ATGGCATTCC AGCAAAAGAG AGAGGCATGG GCAGTGTCAC CTACTGGTGA 
403101 AATAATGCAG ACAAAAAACC AAGAAGAGTA ATTTGGTACT GTTGGGCCAT 
403151 AACAGATACA TAGGGGAATG GTGGGAATAA GGATAGAAAG TGACTGGGGA 
403201 CAGACAATAA ACATAATTCA GTCTTACAAG TTTGGATTTT TCCATGTAAG 
403251 TTACAAAGAC CCATTCAAGG AATTTTAAGC AGAGAAATAA TCAGAACAAG 
403301 TCATGCTCAG AGTAAATATA GGACATCAAA TTGTTTATCA AGAAAAACAG 
403351 TGTTTTTTAT TATACTTTGT GTTTTACTGG CAATGACATA TTCAGTAGAT 
403401 CTCTTTTTAT CTCAGTTGAT ATTAGACATT AGAAATGAAG GCTGCGGCTA 
403451 TATTTTAAAA TGGTAAAGAT CTTCCATCTG TCTAGATAAA ATTTTGTTCA 
403501 ACGTTTCAGT CACTTAGGTG ATGGAAATTT GGCTCTCCTC CATGAATCCC 
.403551 ACTTTCCTTG ACTCCTTCCT ACTACAATGT CAATTCTCCT GCCCTTTCAA 
403601 GGTAGAGAAA AGTGTTTGTA GTCTCAAAGA TACCAGATGG TATCTTTGCC 
403651 TCCTTTTGAA ATTCATGGCA CTTATGTTAT TCCACTTTTC CACTCAATGT 
403701 CTTTGTCACT TGCCTATATC CTAGTTGCTA CCAGTGTCAC AGAAAACTCT 
403751 GAAACTTGAT CCTTGATTCT CCTTTCCATT TCAACTCCCA CCATCATTTT 
403801 TGACCACACC AATATCTTTG TATGCAGTAC AATAATCTGG GAGTTTGTAG 
403851 AAGGAAAATG GTACTTTTGA CTTTGGGGAT TAAGAGAAGT TTTAAGGGAG 
403901 ACATGTTCTG TTGCTCTGCT CTTCCAAAGG TAAATGAAAC TGGAATTATT 
403951 AAAAATTGGG AGCAGAGGCC ACATTTCCTA GTTGGAAGAA ACAGCACAAG 
404001 CAATGGCATA ATATGAAAAG GGTGGCAGGA AAGCACATAA TCCTGTGTGA 
404051 TTGCATCATA GAGGATATAA AAAGGGGCAA TATGATACAG GGCTAAAGTG 
404101 GCTAACTTGT TCTACATTTT AGAGAAACTT GTTTTAATAT CAAGAGAAGG 
404151 AGACTGAATT TCATTTGGCT GGCAGTGGTG GCAGGGAGGT GGAGATTTTT 
404201 GAGAAATGTA ATGATCAAAC TGTGCCTTAA GAAGGTGAAT GAATAATTTT 
404251 TTAGGTTGGG AAAGAGGATG GAGAGAGGGA GAAGATAGAG TGCAAACTGC 
404301 TGCTAACTAT TTTGCTTGTT GTCCAAAGTG TGTCTAAGAT CATCTAGGCA 
404351 GAGGTAGCAG GAAGTAGAAG GAGCACAGTG TCTATAACTG CGTTTCAAAG 
404401 AATGCCAACA CTTAATAACT GAATAAGCTT GGGTTGGTAT TCAAGTGAAC 
404451 TTGAGGAGGA GTGCCTGACA GGAAGGAGGG AAACCAGGG4 AATACTTGGC 
404501 AGTGGCAAGT AGAGAGAATA TGCTCTAGGT CAGAAGCACC AATAGTGTTA 
404551 AAATGAAAAC AAGAGGTCAA ATAAGATGTG AACTGGACAG TAAATATTAG 
404601 TTATAATAAC ATGGAGATCA TTGATGGTCT TAGCTGGTTA TTAGATCATT 
404651 TGTTAGCTTC ATTGGGGGCA GAATCTGGGA GAATTTGGTA GAAGGAAGTG 
404701 CTGCAAAGTG GGAGACTCTT GAGGGAAATT TTTAAATAAA CACTGTTGGT 
404751 ATAATTAGGA GGGGCAGATA TACATTCAAA TCCTGCTTCT CAATTAATTT 
404801 TATGTGTGGC CTGACTTCTC GAAATCTTAG TTTCCTCCCG GTGGATAAGA 
404851 GAATGGAAGC CAATGGGTCA TTATGTAGAA TAAATGAGAT CATAAGTCAA 
404901 TAATTGGATG AAAAGAGTTC TGCGTTTGCT TGACCTAAGT ATTCAATAAT 
404951 GGTGGCAATT AC TATATTAT TTGAATCTGG ACAATTGTTA CATATTGGAG 
405001 GAGGAGGTTA CAGGTGTATA CATAACTTGT TAGTTATGTT TTGACTGAGA 
405051 ATAAAACTGT TATCAAGAAA AACGAGGAAG TCAAAAAGAT AATCTAAAGA 
405101 AGAGAATGAA GTAGGTTTGT AATTTTTTCC CTTTTAAAAA TATGCATGCA 
405151 ATGAAATGTG CAGATATTTT ACATTTCTGT GGGTTTTGAC AAAAGCTTAT 
405201 ACACATTTAA TTCACAACCT TTGAGGCTAT ATTCTATTTC TATTACTTTA 
405251 AAAAGTGCCT TCATTTTCCT TTGCAGGTCA CTCATTCTAC CACCACCTGT 
405301 GGGGGCAACT ACTGTTTATT TCTATAACCA TAGATTAGTT TTTTCTACTC 
405351 TAGGTTTTTT GGAGTGATAT AGTACATACT TGTTTGTTTC TGGCTTGTTT 
405401 GAATATACTA CTTTTTAAAT TTATCCCTGT GGTTTGGGGC ACTGATCGTT 
405451 TGCCCTTTTT TTCATTGCTG GGTTTAAATA TACTAAGTAT ACCATATTTT 
405501 TTTTGTTTGT TTTAAAATCT ATCCACTTGT TTATGGACAT TTTTGTTGTT 
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405551 TCTAGTTTTG CGCTATTTTG 
405601 TAAGTTTCAC TGTATAAGAA 
405651 TCCTTATGTA TTATTTTGTT 
405701 GTTTAGAGCG GGTATTCATA 
405751 CAGTGTAGTT CATGATTTTT 
405801 AATATTTGAA GTATTTTTCT 
405851 TAAATTTGTC TTTTGTTTTT 
405901 ATTTTTTTAA TGTGTATTCT 
405951 TCAATTTTTT TTTCCCAGAT 
406001 TTTGGATATT TATTTCTCTG 
406051 AGTTACACAG ATGTTAAACC 
406101 GTTAATGTTT AAAAACTTTT 
406151 ATTTCTGTGT ATTGAAGCTT 
406201 TCTGTTAAGC CCATAAGATA 
406251 TTCAAGATTT TCCATTTGGT 
406301 GAATTTCTAT TCGTTCACTC 
406351 GAACATATTT ATAACAGCTG 
406401 TATGTGATTT CTCTGGGCTT 
406451 ATGAAAGAAA TCAAGGTACT 
406501 TGCTGTGATT TGACTTGACA 
406551 ATCATTAATT TATTCTAATT 
406601 TAGCCTAAGT TGGTGTAGAA 
406651 GAAAAATACG ATTACACATG 
406701 GGATTTGAAC TGTTTTATCT 
406751 GTAGTGCTGC ATTGAGTAAA 
406801 GTTTAGAAGT TAAGAAAGAG 
406851 GTTGATGCCA AAGACAGACA 
406901 CTCTCCCAAC TTACCTTAGA 
406951 CCTATGTAAA GCCTGTTAGA 
407001 GAGAGCAAGG TGATTTTGTA 
407051 TGATACATGC TTGGCACTCT 
407101 GGTTCTTCAT CGAGTTTAGA 
407151 ATTTTACAAT TGTGTAGGTG 
407201 TTTCTGATTC TAGACCAAAG 
407251 TATTTCTTAT TCCATAATCT 
407301 AGAATGGAGA TAGAGTTATA 
407351 ACAGTTGTGT TTTTTTCTTC 
407401 ATAAGTATAT GGTGTTTCAC 
407451 TTTTAATGAT AGTATAAGCA 
407501 CTCTTGAAAT AAGTATGCTG 
407551 TTGAAACTTC GTGAAGCTTG 
407601 AAATACAGAC CTCGTTTCAC 
407651 GGAAAATTGA TGAGGGCATT 
407701 AATAAATGGT GTCATTACTT 
407751 TAAAATTAAG GGTTATCTTG 
407801 TTCAAGGCTT TAATAAATGA 
407851 AGCTTTAAAA GTAGATGACT 
407901 GGAAACAGAA TGGGCTATAT 
407951 TGGTTTGAAT TTATTAATAA 
408001 AGTATATTTT CATAGGAAAA 
408051 CACATAGAGC CATGAAAGTT 
408101 AGGTTCAGTA CACCTAATGT 
408151 TTTCTAGGAA GTTAGAGGTA 
408201 TACTTGTTAA TGTGTGTGTG 
408251 GCATTTATTA CATTATAAAA 
408301 TTCCAATATT GGAATATAAA 
408351 CATCACTATT ATTATCATC A 
408401 TCTGGGCCAG AGAGATTCCA 
408451 CTCAACCTAT ATCTGTGAAT 
408501 GGCTGATCTG AACAAAATCA 
408551 CCAGAAGGCA AGGGCAACTT 



AATAAAATTT CTGTGAACGT ACACCTTGTA 
ATCATCACAT GCCCAAGCCA TGTAGATAGT 
TTCCCAATAA ACTTTATAGT CAGCTTTTAT 
TAGATTTAAT TATTGTTTGT GGCTAGAATT 
TTCTATATTA ATATTAACAT GTTTCAGTGT 
TTCTCCATTG AATTGCTTCA AATATTTTTA 
AGAAATATGA CTATTATGTA TCTAGGGTCT 
GTTTGGGATT CATTGAGCTT CCTCTAGGGA 
CAGATTTGGA AAGTTTTAAG CCACCATATC 
CCAATCTCTT TGTCTTCCTT CTGAGATTCC 
ACGAAGTGAC CCAGGGGTCA CTTAGATTCT 
TTCCCTCTCT GTACTGATTT TATAATTTTT 
ACTGATTGTT TCTTCTGTAA TATCCCATTT 
ATTTTACATT TCAGATATTA TACTTTTTAA 
TCATTTTTAT GGATTCCATT CATTTGATGA 
ATTATAATAT CCATCCTTTC CAAAATCATT 
GTTTAAAGTC CTTCTCTATT AATTCAAATA 
GTTTCCAATG GCAATTTTTT TCTCTTGGTT 
TTATTATCAT TGGAACCATG ATATAAATAT 
CAGCTACAAG CAATTTTGAT ATGCCTTGGC 
TAGTGTTTAA TGTCCCAGTA ATAATATTTC 
TAGAAACCTC AGCTCTTATG CATTGGAATA 
ATCATGAAAA GCAATGATGC CTGAATATTT 
CACTTAAACT GAGAGTGATA GGTTAAAAGA 
GGCTATTTTG ACAGAATTTG TTGAACCTGT 
AGGAAGAAGA AACAGGAACA CTTTAAAAGA 
CATAAACAAA GGTGAGATGC TTTTCTATTT 
AGGCTGTGCT GTAGCAAACG ACTCACATTT 
AACTTTTTCA GACTATTGAA TCGCCACAAA 
TAACATCTTA TTCAGTATTT ACTAGATAGC 
AATGACGTTG TAATGGGTGG GCACATCCAT 
GACTGGTTAG TGTATTATTT TATGCTGTTA 
CCACCAAGTC CTCTAATGAA CAACCTTTGC 
ATCTTCTCAC ATATATCTAT TTCTTACATC 
CTCAATCTCA TGTTCTTTCT GTAAGGAATG 
TAGAGATCAT AGAATCTTGG ATGTGGCTTT 
TGTTTTAAAG GGATTGGGGA GGGAGGTCTT 
AGATACAACA CTCTGGGATT CCATTTCAAG 
GTACATTGAA AGACAGCCAA TAACATTCAT 
CAGGGCTGAA ATAACTTTGG CTTTATTGCT 
GGTTAAGCTC TTGGCTATGT TACATCTGAG 
CTGTTCCACA CTACAGCTGC AGTTTTATTT 
AAGTTCTGAA GTCATGTTGA TTCTAATGTA 
CATAGTTGGG AGTGCCACTA ATGCAGACCT 
TTGATTGGGG GAAAATTAAA GTAATCAACT 
GATTGCACAC TTAAAAATCA TTTTCAAACC 
AGAAAATCAA TAGCAATAAT ACATACCTAT 
TTATTTTTAA GGATAGTTTC TTGTGTTATT 
GGGGCTTAAA AGTAATCATA TAAAATATTA 
TACCAATGGT ATTTTTTGTG GCAGTTAATA 
GAATTTTCCA AAAGAAAAAA ATGATGGTAC 
AAATTCATGA AATCAAAGTA TATAAAGGTT 
CAGAGGGCTC AATTCTATTC CTGTATGTAA 
TGTGTGTGTG CATGTGTGAA TTTGTGTGTA 
TATAGTATAT AACCTTAGAA ACTAAGCTCT 
TTGCTTTAAA TATTTGGACA CAATCACTAC 
TCATCCCAGC TCATATTTAT GTAGTCTGAG 
TGCTAAGTCC TATTTAGTAT GAAGTGGGCT 
TAATCTTTGT GTCACCACAA GTGTCCAGTA 
GTACTCAGGA TATGCTTTGA ACTGTTAACA 
TCACTTTTAC ACACATAAGG GAGAGTGTAT 
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408601 TCTTCAATTT CATGAGTAGA GAGTGGTGTA CGTGAGAATA ATTTATAAAG 
408651 TGATGTCATA AGTCCCTATG TGCCTGGGAA ACACAAACCT GCCTACTCCA 
408701 GACAACAACT GGGTGGTTTT TATTATTGTT GTTTGCTTTT ACGTTTGAAG 
408751 CAGACTTGTG TGGCACTTCA GAAACAATAT GAGATATTTA GCAGTGTATA 
408801 AATTTGAGTT TGGAGTTGAT TCAGGTCGAA CTTTCAACAG TAATCTGATG 
408851 TATATGTAAT GGGATACATA ATTAGTAGTG TGCTGTATTG TAAACATGGC 
408901 GGAAAAAATG TCAGAAGACT ACATGCTAGA CAGTCCTCAT TGTCTGTCTT 
408951 GCCATCCCCA GTTCCTGGGG CCCTGGACTA AGCAGGCACA ATGGGTCTGA 
409001 GCCAGAAAAG TCCCCATTGT AAGTATAATC ATAGTCATTG TAATAATAAA 
409051 AATGATTTCC ACTTGTTTCA ATTTACTACA TGCCAGACTC TATATTACAT 
409101 ACTTAACATG CATATCTCAT TTTAATAACA ACACTGAGAG ATAAATAGGT 
409151 TCAATTTTCA TACGAAAAAA ATGGAGGCTG AAAACTTGGA AGTGAGAATG 
409201 ATATGATATA ATGAAAATAA ATGAAATTTG ACATCAGATA GGCAGCATTG 
409251 CAACTCATGA TTCACCATCT ATTATGTGAT CATGGGAAAA TTACTCAACT 
4093 01 TGTCTGCGTT TTAGGTTCTT AAACCATAAC GTCGGGATTA CTTTTGTTCT 
409351 CCTATTCTGA GGATGCATTC AGATATGTAT GTAAAACTCC AAACATAGTA 
409401 CCTAAGACAT AGGAGGGCAT TTGCAAATGT TGGTGGTCCT GACCTCCCAC 
409451 CATTCCTTAT TGACTCTGCA GAGTTAATGT TGACCTGGGA AGTTTTCACC 
409501 CTGGCTCCAG TAAGGAAAGT GAAAATGACT AATGATTTTC TTTATCCAGT 
409551 AAAGGATGTG GAGGATGGGG CGGTAAACTT GGGTACTTTT CTACATCCTC 
409601 CTCTTTTCCT GGCAAAGTCC CAGTAAATAC TTATGATAGC TTTGTGTAAA 
409651 AACAGCACCT GTAGCCAGAC ATGGCACCAT GGAAGGCAGG TTGGTGGGCA 
409701 AATGCCCTCT AAGTGTAAAA AAGAAGGAGT GTTGTGGGTC CCCAGATCAC 
409751 CCACCAGATT TGATGATTTG CTAGAAAGGC TCACAGGACT CATCTTGTAG 
409801 TCATATATGT AGCTACAATT TGTTACAGCA AAAATATAGA AAGCAGAATT 
409851 AACAAAGGGA AAGGTACATG GGGCAAAGTT TGGAGGAAAC TGGGTGTAAG 
409901 CTTCCTAGAG TCATCCTCCG GTGGGGTCAC ACAGGATTGC TTAATTTCTT 
409951 CAGCAAGGAG TTGTGACAAC ACATAAATGC CTTCCAAGGA GTCTAATTGG 
410001 AGTCTCAGTG CCTAGAGTTT CTATAAGGGG CTGCTCACCT AGGCAACTTC 
410051 TCCATGTACC AAATTCCAGA CATCCTGAAG TGTTCAGCAT AGACCATATG 
410101 GTTTGCATAA ACAGGCAAGG CACGAGGAGC CATTCTTATC AGGGAATGGT 
410151 GGGAACTCTC TTCAAATCCC ACTTTCCAGA CCTAGCCAAG GACCAACTTT 
410201 GCAAGCAGAC CTTTATATGG ATAGCAGTCT CAGGCCTTCT CAGTTTATAT 
410251 TAACTCATAT TAACATAGTC TTAGGGTGGG TTACCTCTTA CCTGCACAGA 
410301 AAGACATAGT TGCGTGCCAG GACACAAGCA TAGCAGAGCA GGAAACAGAT 
410351 GGAGTTTTGC CCTGACCCAT CTGCGTGTAC AGATACCTTT AGGCAATGAA 
410401 CAATCTACCA AACCATGGAT GGCTTCCCTG CCTACTAGAG AAGCCTCACA 
410451 TGGAAGAGCA CGCTGTGTAT GACCTCATTC TGCCTTCACT CTCTGGTCCT 
410501 TTAAATGTTG ATTGATCAGC CTTTTTTCAC TGACCCAATT TGCCTTCTCT 
410551 GAAGATGCTT TTTGCAAATA CAGCAGGTTT GATGAATGCT TAGCAAGAAG 
410601 TTTGGATGAC TTCTGCACTG GTTTCTGTCT ATTTGGGGCT GATTGGAAAA 
410651 CTTAAGACAC TGAATTTATC TTATACTTGA AATTCTGTAA CCACTGTTAT 
410701 TTATGTCTCT GTGTCTGTCT CTTTCTTTAA TCTTTGTTTT GTTTTTCTTC 
410751 AGTAAGCAGT TCTGTGAGGT AGTTAGGACT ATGATTGGGT AGTTGAGGCT 
410801 CCATGTGGTT AAGTAATCAT ATCAGAAGAA TTTGTTGGAA AGACTCAGAC 
410851 CAGAATACAG ATTTGCTGTT TCTTGAATGA GCATCTCTTC TGTTGAATCA 
410901 AATTAATACT ATAAAAGTAA TTTCATGGAA AGAATAATCA TAAAGAGGCA 
410951 ATATCAGTCC AAATACAGAA AGGCCTTGCT TTTCTGACTT TCTGTAGTAG 
411001 TAGCAAACCC AGTGGAGTTC TGAAGAATGT GGTCATTGCT ATGGATTGAT 
411051 GTTAGAGACC AGAGCTAGAT TAGTCAGACT GGTCAACTCT TCTAGAGTGA 
411101 CTGGAAACTT AAGCATCTAA GAACAAGGGC AAATCAGTAC AAATACAGTT 
411151 AACCCTTGAA CAATGCAGGT GTGAACTGCA TGGGTCCACC TATACGTGGA 
411201 TTTTCTTCTG TTTCTACCAT CCCTGAAATA GCAAGAGCAA CCCCTCCTCT 
411251 TCCTCCTCCT CTTCAGCCTA CCTGACATGA AGATGACAAG AATGAAAACC 
411301 TTCATGTTGT TCCACTTTCA CTTAACAAAT AGTAAATATG TTTTACTTCC 
411351 TTATGATTTT CTTAATAACA CATTTCTTTA GCTTGTTTTA TTGTAAGAAT 
411401 ATAGTATGTA ATACATATAA CACACAAATA TATGTCAATA GACTGTTTAT 
411451 GTTATTGATA AGCTTTCCAG TCAGGGATAG GCTACTCGTA GTTAAGTTTT 
411501 TGGGGAGTCA AAAGTTGTAC TCAGATTTTG AATGCACAGG GTTTTAGTGC 
411551 CCCTAACCCC GATATTATTC AAGGGATAAC TGTAATCAAG ACCTGACAAG 
411601 TCCTGTCTAG TCATCAAATC AAAAGTTCAT GAGTGAGGAG AGATGTAGGA 
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411651 GAAGAACTTA AGGATGACAG 
411701 TAAGGAACAG ACTTCAAAGT 
411751 ACTGGGCTTG GGTGTTCACT 
411801 CATATTATCT CCCTCTTATT 
411851 TTGCCTGTTT GGAAGCCACT 
411901 TATTTCATTC AGTATCTCCC 
411951 GTTAGTTTTA TAAAGACATC 
412001 AGCTTGCAGT TCTTTACCTT 

412 051 CTCACTTGAA ATCTGAAACA 
412101 CTTTAAATAA TGAAACAGGA 
412151 TATTTTATGC CTGCATTTTA 
412201 CTTGGCAACA TTGCAAAGTT 
412251 TCCTGCTACA TGGAAATTTT 
412301 TTGTATTTCA GTGTCTGCCT 
412351 TGTCTTATGT ACCAGATCTA 
412401 AAGTTCTCAA TGTATTCCTT 
412451 ATATATTCGG ACCAGTCTAG 
412501 GGGGAGAAGA CAAATGCAAT 
412551 ACTTAATCAG ACCCCAAAGC 
412601 TATCTCACAT TCACATAATT 
412651 TGTCGCATTT CAGCAAGAGT 
412701 TATCTAAGTG CTGTCATTGG 
412751 GACTTAGTGT GGTTAAGCAT 
412801 AATTGTGAGA ATAATCATGA 
412851 GGAGAGAGTG CTAAAATGAA 
412901 ACAGGATGAT GGAGACATCA 
412951 TACCCTTAAT TTGATACCTT 
413001 GTCTGTCTCC AAAATACAAG 
413051 GTTCTCACTG GCTTTTATAG 
413101 GTCCAGAGGG AACACAGTTC 
413151 GAAAGATAAC GTGTGATCTA 
413201 ATAGATACAA CTGAGGGAAG 
413251 GATTTATGGG TGTTTATGTG 
413301 ATGGGCTAGA TTTAAGTATG 
413351 CATGGACCTT AAATCTTGCA 
413401 CATGCAAATG TCTCCTTATA 
413451 ACTGGATATG TGAGACTCTG 
413501 ATTCCTTCAC AGCATTTACT 
413551 GAGGTAGGCC TTCTTTCAGT 

413 601 AAACACCTTT TCTTCTTTGC 
413651 GTTGTAATGT TATGCTTTTG 
413701 GACTCCCACT ATGTGGTTTG 
413751 CAACCCCAAA CTATTGTTTC 
413801 AGAATCTTGT GCATCCAGTG 
413851 TCCTTTATGT ATTATGACCC 
413901 GGTTTTCATC CACTTTCCAT 
413951 AGCACCATCT TCCTAAAGCC 
414001 AAGTATTTAC TGGTCCTGAC 
414051 GAGATGTTGG CCCTTATTCT 
414101 TACGCAGGAA ATAATAAGAG 
414151 GATCAGTGCC ATTTCAATCT 
414201 CCACCAAGTA CAGATTTGTT 
414251 AAATAATGGT AAGATGATCC 
414301 TTGAGGAGAC GGTGGTAACA 
414351 CATGATGATT TCACATTGAT 
414401 TTATCTCATT TGATCTTCAC 
414451 CACGAATGTA CAGTTTGGAA 
414501 ATGTTCATCA CAGCACCATT 
414551 CACCGGGTAA AGCAAATGTG 
414601 CAAAAAAAAA AACAGTGAAA 
414651 TGGCGGGCAT TATCCTCAGC 



AATGAATACA TTAAGGAATA AAGTAGCAAG 
AAGGATGAAG GTCCAATTGT TTTGCCAAGA 
TTTCATCTCT AAGTCTGTTT TATTTTAGCC 
TGGATATAAG ATGTCCAGTA AGTGGGTAAC 
ACCTGTTATG AACAAGGCTT AACTTGTTCA 
AAATGTTGAA ATAACGATTT GGGCAGGAGA 
CCACCATAAA AGATATTTAG TATTCAATAA 
AATGTTGTGT CTTAATATAT TGACTCAGGT 
TGGGTTCCAT GATCTATACA CTTAATAAGC 
TATACTTGAA GGGCCAGCTG TCTTTGTTTC 
TGGGCTTGAT CACTAGGACT CAGACTGGTC* 
AGCGAAGATC TTTCCAGCCC TTTCCTGACT 
TTCTCTGACT GCCTGATGCT CTCATGCTGC 
TATCCCTAAA GGATTTGCCT GCCCTAGAGT 
GGTTTATCAA TATTCTGGGA GTCATTTGTT 
CTTATAGAGA AGTCTCTTAT TCACAGTCTC 
AGTACAATGA GTGGGAATGA CAGTCTGGAA 
ACCATCTGGT GCCCAGAAGA ACTTCTGCAT 
CTTATATCAT CAGCAAAGTG GGAAGCTTAG 
TTTAGAAGCC ATTTCATACC TTATAAATTT 
TCTTGATATG TATTTAAACC TGGACTCTCT 
GAAGGACACA AAGTGTTACT GGGTATTTGA 
ATTGTTGAAA AGATGGGACT ATACTCTATA 
CTTCTCTATT TCCCCGCCTG GGATATTCAA 
AAGTACCATA AAAGCTCTGG AGTAGGACTG 
CCATGGTGAA GGTTGTCCTG GGTTGTGAGA 
CTTCCTAAAA GATTAATTCT TACAAAGTTT 
GGCCCTAATA TGGATCAAAC AAATAACAGG 
AAAAGATAAA ATGTCACTTG AGAATGAGAA 
TCCATGCTGG AAACAGAAGA GAGCAAGAAA 
ACGAACGACA CACTGCTCTC GTTTGTGTGG 
AGACAATTTG TAACTGGAAG GACTAAGGCA 
CCTATGTGAC AAAAGGGACC TTGGGGTGTA 
AAGTATGACT GCCTCATGAA AATGTTCATT 
CATAATGAGG ACAAGAAACT GTAGCTGATT 
TATTTCTTAA TATTGCCCAT TTTCTTGTTC 
GCTTCATCAT CTCATCCTAG TAGTCAACAG 
GAGTCCAGCT CTGGGAATAT GCAGAAGGCT 
CTGTCCTCAT AGTCATCAAA CCTGAGAGGG 
CTTGTGTTCT CTGCATCTTA CTGAGAAGAT 
TTTGATTTTG TTTCCTCATA GAGACTTTCA 
GACACGAAGT CAGTCTGAGT GCCACCGCAC 
TGGACCCAGT GTGGTCTTTT CTTTCAAGGA 
CAGGGGTTGG AGGATTACAG AAGTTCTTAT 
CCAAGCTTGT GAGCTGGTCA TGAGCCTCAA 
CTCATTAGGC TTTCAGCCAT CCCTGCAAAT 
CTGGTCATTA CTCCGTAACT CTCCAAACAC 
TAAGAGACTT GGCTTATAGG GGATCTGAGT 
TAGTGAGTTA CAATCTTCTG AGCAGACAAA 
ACTAACAGAA GATAACCAGG TGGTTTGGAT 
CTGCTCAAAG CTGACATACT TACCACTAAC 
CTCCAAATTC AGAAATGAAC AAATAATTAA 
CAGAAAAGAT TAGTGTAGAA GGAGAGGGGT 
GCAAATAATT TGCAGCTCGA GAAAGATGAT 
CAAATTCTAT TTCTATCATA TTTATATGAA 
AACCCTAAGA TATAAATAAT TATCTTAACA 
TCATTGTACC AAAAAGACAC ATACACTCAT 
CATAATGGCA AAGACACGGA ATCAATGATG 
GTACATATAC ACCATGGACT GCTACACAGC 
TCATGTCCTT TGCTGTGATA TGAATGCAGT 
AAATTAACAC AGGAACAGGA AACCAAATAC 
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414701 TGCATATTCT GACATATAAA TGGGAGCTAA ATATTGGGTA GTTAGGGCAT 
414751 AAAGATGGCA ACCGTAGAAA CTGGGGATGA CTAGAAAGGG GAGAGAGGAA 
414801 GGACGGCAAG GGTTGAAAAA CTAACTATTG GGTACTATGC TCAGTACCTG 
414851 TGTGATGGGA TAATTTGTAC TTTAAATCAC AGCATCACGC AATATACTCA 
414901 GGTAGCAAAC GTGTGCATAT ACCCCAAGAA TCTAAAATAA AATTTGAATA 
414951 AATAAATAAA AGGTTAAAAA AAAGAATGTA GAGCTTGGAA GGTTTGATGA 
415001 ACCGTCACTT TCATCGAGAA TACAACTCCA GGAAGTGAGA GTGAAGTTTC 
415051 CAGAAAACAA AGCAGAAAAG CAGCAAGAGC CACAATGAGG GTATATTTTG 
415101 GAGTGGCGGA GACCTATTCT TTGCTTGATC AATGGAAGCA TATTTCTCCC 
415151 CAGGAATCTG TATAATCTAT ATCTCAGAAC TTTCTCCCAG GGGTGGAAAA 
415201 AGAGATAAAT ATCCTTGACT TGTGTCTCCC TAGGCCAACA TTTATCCCCT 
415251 GGGCCTGAAC TGCTTCCGTG ACAGTCAATT CATTTGCAGA TTCCCAGGAG 
415301 CCACAATAGT CTGTAGGGAA GCCTGGGATG GGAGGTGAGA AGCAGGGGGT 
415351 GGGGTGCTCC CAGCGGTGCC TGTGTGAAGT GGGTCTGACC CCACACACAG 
415401 ATGGGACAAG AGTCAGGAGG TGAAGCCAAG AGGATTTGAA ACGAGGCAAA 
415451 AGTGATACCT AGTCCATTAC TGTTTTCTTT GTTTTACAAG TGAGGAAATC 
415501 TCAGGCACAG AAAGCCAGGT AAGTTGTCTA AAGTTGAAAT CTGAGATGTG 
415551 ATAGGGAGGA CAATGTAAAC ACAGAGATTC CAAACTATCG CATCCAAACT 
415601 CCCAACTACT AATTCCTAAT GGAAAGGGTT CCTCTTGAAA ACTTTCAGAG 
415651 AATGACCTAC TAGGTGACAT AGACTAAACC TTCAAAGAGC ATCAACAAGA 
415701 ATTGTAAGAA ACCTGTGAGT GAGAAGCACT TACTGAGTTA CCAGAGGACA 
415751 GTGGGAACTA AGAGGGTGTC TTTAACCTGG AAAGCAACCA GTGAAGCCAA 
415801 CCACATCCTT CAATCCCACA TCCTTTGATG CCCTCACCTC TCACGCACAA 
415851 TAGAGATCTG AGCTGCAATT TGGCACCCTT AAATGTACTT GGCTATGTTC 
415901 CAGAACAGAG AAAAGGCAAT GTTCGTATCT GTTTCACGCT TGCCTCGTGC 
415951 CAGAGACTGC TCAGTATTTC ACGGGAACAA CACTTCAAGG AAGTTAATAG 
416001 CATCTCCATC TTACGGATGA CGAAACTGAG GCTTAGATGG GTAAAGGATG 
416051 TTCTTTGCTT CCCCACATTG TTTTTCTGGT ATATATCACA GGCGTATGTT 
416101 TTGTTTTGGG TCTGTATTTG CCTTCTAGAA TGATATTCAA GAGGTGGGGT 
416151 TATAAACCCT GGCTCGGCAT TAGAATAATT TAGGGATCTT GTTAAAATTA 
416201 CAGAACCCCA AGCTCTACCT GAAACCTAAG GCATTAGACC TTTTTAACCT 
416251 AGGGGAACGG ATATCCTAAT GTGACCATAT TTCACTGGGG ATCTGATGGG 
416301 AAGCCAGGGT TGACACTCAA AAGAATGGAC AGAAAATAGT GAACATGTTC 
416351 CTGAGCTGCA TGAAAGAAAT ATGCAAGATT CAGAACCAGG GCTGGGGGAA 
416401 AAGACAGTCC TGCTCTTCTT TGTACAAATC AAATCACCTC TGAAGAGGTG 
416451 CTTTGAGTTT TGGGTAGCTT ACTTTAAGTA GCAGAATTGG AAGGAAGAAC 
416501 TGTACTGGGG AATAGGACTG TGTCCAGGGC ACCAAGACAA AGCCAGACAT 
416551 AACCCAAGAG GCTGACAATT TGGGGAAGTC TGGTGACGGC AACTCTCTCA 
416601 CCCACAGTTA ACAAACAAAC AAGAAATATT TTCTCCGGGC TCAGTCTCTG 
416651 TAGTTTTGGC AGCAGGGAGT CAGCGTCCTT CCTGAAACAT GCCCTGCTGT 
416701 GTTCCTGCCC AGTCTTGCCA GCAGCTCAGT CTGCATCTGG GCTTTTCCCC 
416751 ACCCTCTGTT GCCTAATCCA CAGCATCCTT ATCCCTCCCC TCAGGCCACC 
416801 ACCACCCCCC TTCTCCCACG CACCCAGGCT GTGTTTTAGC AGGACAGCTA 
416851 TTTTCTGCCA TTCAGCACTG AGTGATCATC ATCAGGTACA GCTTCCCAAA 
416901 CACACAACTG CGCTGCTTGT GTAGCCTCAG GTATAATTAC GAAGGCTGTC 
416951 CCAGCACTCG CTACCTACAC TATACTTAAT GGGGAGGGAT GGTCTCTTGT 
417001 TCTCTATTAA ATATTTTGAA AGAGAACCCG TAGGCATGAC AAGGTTCAGG 
417051 CACCAAGCAG AAATGGGAAT CGGAGTGCCC CTGCTCTGAC AGGCAAGCAC 
417101 AAGTTTTAAT TATCTGCTCA TCCAGGATTT AAAAATTTTC GTTTTACTAT 
417151 AAAAAGCAGA AAAGGATGAC AACATGATTT CATGTGCTTA CAGGAAATGA 
417201 AATTGGTTGA GAATATTGTC TGCTGTCACT TCTCTCTTAT TTCTTCCTTT 
417251 GTAGTGTTTA TTTTTCCTGT TCTATGAATA ATAAATGACT GAAAATATAA 
417301 AAATTACAGA AATGCATAAA GATGAACAAC TTATGACCCC ACAGTTCAGA 
417351 CAAAACTTCT ATTAACATTT OTATATGTTT TCATCCAGTC TTTTTCATGC 
417401 ACACTCACAC ATACCCACAT AGAGACAAAA GAATTGATTG TATTATGGTA 
417451 GTTTAATATT CTAATTGTTT TGTTTTGTGT TACATTATGA GAAAATATTT 
417501 ATGCCAATGG ATACATTTGA AAAACAACTT TTCTAATAGT TCTATATTTC 
417551 ATGTTTCACT CTGATTTATT TCACTTTATT TCCCTGTTAT TGGATATTTA 
417601 AATTATTTCC AATCCTTTGT TGATAACTCT TAATAAATTA ATAAATCACC 
417651 CAAGATATTA ATCAAGCTCA TCCTATGTGT TGTCACTGTT CATGTCAGCA 
417701 TTGTATGAAG AGTGAATGGG CTTTGGTGAC TGAAATGATA GGACTAAATA 
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417751 CTGATTCTAC CACTAGCTCA GGGCATCTAG GAAAGTCATT AAAAATTTAC 
417801 TCATCTGTAA AACAGAGCTC ATGCTAACTG CTTTGCTTAT TTTTCTAATT 
417851 TTGTTCTCTC CATTTCCTAG GACAAACAAA CAATGTCAGA TAATCTCTTT 
417901 CTACATATTC TTACTCATTC TGTTTCTCTG AAGTTGCAAC TACACATATA 
417951 CTCACACAAT TCCTTTGTTA ACACTACCTC ATTCTCAAGG TCAAATCCAA 
418001 ATTCTACATC AACGTCTCCC TCCACTGGTT TTCTTCATCT CTGACCCTTC 
418051 TTTTTCCAGG CCCTGCACGG ATGGCATCAA GGTACTTTCA TTGACTCTGG 
418101 CTCTTCACCT ATGACTTCTT ATGCCTTTGA GTCTGCCATT TCTTTTTCAT 
418151 GGGATTTTCT CCTGACATTC CTGCATCCCT TTTTATCCAG CTCCTCCTCA 
418201 TGACTGAGAA ATCATCTCTA GCCTCAACTC CTCTTGGAAG GATTCCAAAT 
418251 CCCAAACCAC AGGCAGGGCT AAGCAACCCT CCTTGCTGAC TCTTAGGGCT 
418301 GAACCATTTA CTCAGTAAAA GAGATGCTCT ATTTTGAACT TTCTCAATTT 
418351 GAACGTTTGG TGCTTTTTCC CTCATTTCTG GCAACATTTC AATGTTTCTT 
418401 TGCTTTCTCC TTACTATATC TCTAGATATT TCTTCAGCTA TACATGGTCA 
418451 GATGACTGGA GTTCACTTAG AGGTAAGGCT ATCCTAAACC TCATTATTCT 
418501 ATGGTATCTA TTGACAGCGT ATGAATGAGT CTGTGAGGCT TATTACCTAA 
418551 TATTAGAAAT AAAATAAATC ACCGAAAAAC TTGAAGCAAA GTAGTTAGCA 
418601 GTTCATAGAA AAATCCTGAG TTAGATTCCT GGCTCAGTCT TTTACCAGCC 
418651 ATGTGGCATT GACTGAGCCT CCAAGATGTA TGTTCTTTAT CTGTTTACAT 
418701 GAAAATCATA TTTGCTAATT CTTAAAGTTA TGTATATAGA GTTATTTCTA 
418751 AATGACATAT AATAATCACT TAATGATAAT TTTATTAGAA ACTATTTTCA 
418801 GCTACCTGTT ATTTTTACAA AAGAATCACA TTAAAAGATG CTGTTAAAAC 
418851 TATGAGAATA CATGAGATAT AGGTGGTTAT TACTATTTGG CATTCACACC 
418901 CAAATATCTA ATCACTAAAA TTATTATTAA TAAAGAAAAT GAGGCAGATG 
418951 ATCATGGCAT TTATCCCCAA ATCTCTAGCA CCTTGCACAG TTTCTGGTAC 
419001 GTATTTAGCT CCCAGAAATA AACACTGAAT TGAATTTATG GTAAAATAAA 
419051 TTTTTAGAGG CTTTGATATT CCATTTTTAA GCATCCAATC TATTCTATAC 
419101 ACACTCATAT TAGCATATTT ATGTGACAAT GAAACTATTG CAAACAGAGA 
419151 GAAGATTATT AGGTTATGGA TATTAAGTTA ATGGGGAAGA TATAAAATAT 
419201 TTAATAGCTT GGGCTAGACC ACATCTTTTA GAAGAAGAAA CTGAGGATGA 
419251 AAAAATTTCC GTGAACTGCC AGGACACAAA ATTTGAGAGG GGCGGAGTTG 
419301 GAATTAAAAC CCAACCTTTT CTCAAGCCAA TTAGAATGGC AATTATTAAA 
419351 AAGTCAAGAA ACAACAGATG CTGGTGAAGC TGTGGAGAAA TAGGATTGCT 
419401 TTTACACTGT TGGTGGGAGT GTAAATTAGT TCAACCATTG TGGAAGACAG 
419451 TGTGGTGATT CCT.CAAAGAT CTAGAACCAG TAATACCATG ACCCAGCAAT 
419501 CCCTTTACTG GGTATATACC CAAAGGATTA TAAATCATTC TACGATAAAG 
419551 ACAAGACACA TGCACACGTA TGTTTATTGC AGCACTATTT ACAATAGCAA 
419601 AGACTTGGAA CCAACCCAAA TGCCCATCAA TGATAGACTG GATAATGAAA 
419651 ATGTGGCACA TATACACCAT GGAATACTAT GCAGCCATAA AAAAGAATGA 
419701 GTTCACATCC TTTGCAGGGA CATGGATGAA GCTGGAAGCC ATTATTCTCA 
419751 GCAAACTAAT ACAGGAACAG AAAACCAAAC ACCACGTGTT CTCACTCATA 
419801 AGTGGGAGTA GAACAATGAG AACACATGGA CCCAGGAAGG GGAACATCAT 
419851 ACACCAGGGC CTGTCGGGGG TTGGGGGCCC AGGGGAGGGA GGGCATTAGG 
419901 ACAAATACCT AATGCATGTG GGGCTTAAAA TCTAGATGAT GGGTTGATAG 
419951 ATGCAACAAA CCACCATGGT ACATGTGTAC CTATGTAAAA AACCGGCACA 
420001 TTCTGCATAT GTATCCCAGA ACTTAAAGTA AAATAAAATA AAATAAACCC 
420051 AAGCTTTGCC TCCCTAACAC AGTTTAGCTT TCCTTCATAC TTCACCAGAC 
420101 TGATTTATCT ACTCCACATC ACCGGATTTT TGCAGGTCTT TCAAAATCTC 
420151 ATTTATGCAT CTAGGATAAA CTTAAATTTA AATTTAATTT TATGTTAAGC 
420201 ATTGATCAAA CTTGCAAAGT CCCTCTAAGT ATTGTGTTGC TCTTGTTATT 
420251 GCTGTTGCCC TCTCTCCCCA CCTCTAATCC AAATTCCTAA TGCTTGGTCT 
420301 TTCTTTTTAA ACAAAGAACT TGTTTTCATA GAAATGAATC TCTTGTGGAT 
420351 TTTTATGTTA TATGTCAAGC TCCCTTATGA ATCTATCCAA AACATCCAAA 
420401 GAATAAATCC CAGGACAGGA GGAACTTGTA TTCACACATC TAATTAACCT 
420451 TTAATTGTCT ATTGTGACTG CTTTGCAGTT CAATAGTTGC TTGATTTTTA 
420501 TTGCCCGCCT AATTAGTGTG GTTAAAGTGG CTTTTTAAGT CAGATTTTAT 
420551 TATATCAGTT TGCTTACGCA ATTTCGGGAT ACTCCGGGAA GTTCCAATAA 
420601 CATTGTGATT CTTAAGTGCT ATATATGTAT ACACATTTGT TGTTTGTTTG 
420651 CTTTTTCTCC TCAGGAAAAA AAGTGCTTTG ACCAATTTCT CCCACTAGAG 
420701 TGTATGAGTA GTCTATGAAA TTCCTTAATG ATAGCAAAAG CCTTTTCATT 
420751 AATTGTGCAT AAATTATCAA CACAGCCAGT CAATTCTAAT TAATCATTTT 
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420801 TCTACCCAAT AAACCACCAG ATTGACAGAT TTAGAATTCT GAAACAAGGG 
420851 GAGCAGTGAA GGAAACCACC CAAAATACTG CAATTTAAGG CAAACTAACA 
42 0901 TATATGCACA TCTCTAAAAA TCAGATTCTA CAGCTGTAAT TTATTAGTGA 
420951 GTCCATGGAA TGTAGCTAAT TGAGTATGCA TCTATCAGGG GATGGTTAAG 
421001 ATTTAAAATA ATCAATTAGA TGTACAAATC TTATTGCATA CCATGGATCT 
421051 TTTTTCTTCT TTTTGTTTGC TGTTTTATTT TCTAAGTAAA GAATAAACTT 
421101 GCCTCCAATT ATATAATTTG TATTTTCACA TGTTAGCATT CTTTTCTTGT 
421151 TCCACAAACT TTCTAAATTA CTGAAAATTA TCATGAATGC AATTTAATTT 
421201 TCTACATACT GAGCACTATG GTTGTCTCTA ATATTGGCCA CGGTTGTAGT 
421251 TTCCCTTAGA CTCCCTGAGC TCTTGCTTTA TTTTCTTTGT CTCTATTCTC 
421301 TTGGCTTTAA AATAGAGTTA AGCAGTTTGA ATTATTTCTA GATCTCTGTC 
421351 TCTCTCTGTA TTGGAGAGAC AAAAGATCTA GTGCTTAGCA GAGTCCTCAA 
421401 CTCATCACTA ATAGTCAAAG TATTCAGTGG CAATTGCCCT GTGTATGAAA 
421451 ACAGCAGTGC AATACAGAGT AAAAGTACTG GGGCATGCTG TCTCTTGCAC 
421501 AGAACAGGAA CCAAGAGTAA TGGGTTCTAT TCCCTACTCT ATGTTTGATT 
421551 TACTGGTTAG CCTTGGGCAT ACGATGCTTC TTACCCTTTC AATGCGTTAT 
421601 TCTTTCCATC TGGAAAACAA AGATAATTGT ACTTGTAGGG AGGGAAAAGG 
421651 GAGAACCTGA CATAAATTAG TTTGGATAGA AATTATTTGC AAGATTCAAA 
421701 AATGATTCTG AAATGTAATC CATTCCACAT TTATTGGGCA TTTCCTATGT 
421751 TCTAACAATT GTGCTGAGTG CTAAAGAATA CAGAGTTTGC CATTACATTG 
421801 GTGCTCATTA TCTGATAAGG TGTAGCCATA TAATGGTAAT TATAATATAA 
421851 TGTGAAAAGT TAATATTGAT GAGCCCAACT ATCTATTTCT AGCAGAGAGA 
421901 ACTGAACTAA AATTTATTTG AAACATCTTC TGTAACTGTT GGCTCATCAT 
421951 AAATCAAAGT GAATTGCATT TATAAGGATA GTTTGTCTAT TAGTAATCTG 
422001 AATGAAGGAA GACAGGGAAA GACATCTGGA AAATTACCAC AGTAGACTGG 
422051 ATTATCGCTT GTAACTCTTT CTTCCCCATT GTGAAAGAAT TACACATGAA 
422101 TTGTCTTTAC CACTTGACTT TGTAGCGACT CCCATTACAA TTGACAGAGT 
422151 GTGCATCCGT CTTCCATTGA CTGTGGTTTT GGTGTAATGA CTTGCTGTAC 
422201 CCAGTGGAAT ACGTTGGGCC AATGAGGAGA CATTGCACCA GTTCTGAGTT 
422251 GAGTCTTGAA GAGTCATGGG AAGTTTCTGC CAGCCTGATG GGATTTCCTA 
422301 CCATTTGCCA GGACAAGATC ATGCTTCAAG TAGCTTTGCT CCTTGGCTTA 
422351 TGAAAAGAAC AAAGAAATGC TGCACAGACT GAACTCCACC AAAAGCCTGG 
422401 ATTCCAGTCT AACCCAGATG ATACCAGTCT AGACCAGCTG ACCCTCAGAA 
422451 ACACTTAGGA GAACAACATA AATGCTGTTG CTGTAACCTT TGCAGACGTG 
422501 GGGACTGTTT GCTCACAACA TTATCACGGT TAAGGAAATA AGCTGGTTAA 
422551 TAAAACATCT TTTTGTAAAC TCAAACAAAA TTATTGACTA AGGAAGTGGT 
422601 CATCAATAAA TGATAAAACT ATTAGGTGAG AGGTTGATGG GAAATTGAGT 
422651 CTTTACATGG TGTCAGACTG TAACTCCACA GTTTCTTGCC AGGCAGAGAG 
422701 AACAGAC ATA ACATCAATGG AGAGATCAAG CTTTCACTAC CTTAGTACAG 
422751 TGAGTCACTA ATTATGTGAC AACCAGACAC CATAATACAG CATATGATGA 
422801 CACATCAACT TCTCAGCATG ACCTATGAAG TGTTTTTGCC AAAAGTGATT 
42-2851 AACCTGAAAT TAGCAGAGTC CTAGGTCCCA ACTCTCCCTT TACAGGAAAT 
422901 ACAAGGCATA GAGATAGAAA TTAAATACCA CCAGTTAACA TCAGAAAAAT 
422951 TCATCAAGCA CACGTTCTAT AAGACAACTG ACTGGTTACT TCAACAAGTC 
423001 AATGTTACAA AAAGAAAAAG TGACAAAGGA GTGGAAAAAA CCATATTACA 
423051 ATGCAATGTG TGATTCTAGA GTGGATCCCA GTTTGAATGA AAACAGCCGA 
423101 GAAAGACACT GGAAGACAAT TGAGGACATT TACGTATGGG TCAGCTTTTA 
423151 GAATCACTCA GAAATTATTA TATGGTTAGG CTTAATATAC TTAATATAGT 
423201 ATTGTAGTTA GTAGAAAACT GTCCCAACTT TTAGACATGT ATAAGGAATT 
423251 ATTTAGAGTT TATAGCTTAT ATGTAAATAA TTGACTTGAA ATTACATAAA 
423301 AACAGATACT AAAATTATGG CAAAGATGTT AATGCTGATT AAATTTGGAG 
423351 AAAATATATT AAATGTTTAT GATTCTATTC TTTCTAAATT GTATATGTTT 
423401 GAAAAATTCT AAAATAAAAA TGACCAGTGG CAGTGTACTT AGGACTCATG 
423451 GTAAAATCAT TTGAGATGTC ACTCTGTGTG CAATCTGGCC CTATGTTTAG 
423501 TGTTGTTTTT AGTGTTCTCC AATTATTTCA CTCATTTAAA GGCTGTATCC 
423551 AGAAGGGGTC AGTAAGCTGC ATAAAGATAA GCACCAAGTT TTATACTTGT 
423601 TTTCACTTAG CCCAAACACT CAGTGGGTAT GCGAGTAAAT TCATTCCATT 
423651 AAAGATGATA GAAAGGACTT TCTTTTCATT ATTTGCTATT ACATTATGTG 
423701 ACTATTTATT GCCCTCCTTT GTTCTTATCA GAGGAAGCAG AGACTCTGGT 
423751 TGTGTGTCTC CTTATCCTAT GAATTAGTTG ATTAAATCAG TCCAGGCAGT 
423801 ACCCACTTGA AGCAATCATG AAGCCAGTCA ACCGGCTGGT TAGAGCTAGT 
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423851 CAGAGATCAC CCACCACCAC 
423901 TACAGAATGG AAGAAAACAG 
423951 GCAGTGTATT TTTTAGGTCT 
424001 GAACATTATG ACTTAAATAT 
424051 TAATAAAAAG GTGTCCATCA 
424101 TTTTTTTGAA TCCTCTAATC 
424151 AAATACCAGA CTTGATTGAC 
424201 TTCCAAAGGA AAACATAGTC 
424251 CATTAATGAC AAGTACACAG 
424301 TAAGGTTTTT CCTGAAAGAG 
424351 ATAATTTTTC AAAATATTAA 
424401 ACCACATTTC AGGAAACACA 
424451 GTTGTTGGTG TGGGGAGGCG 
424501 CAACATTATT TTAAATAATA 
424551 ATATAATCAA AACAAAAGTA 
424601 GAAGCCTTAC ATGGAAGACT 
424651 AAGGGCATTA GGTAAATTTA 
424701 AAAATATTGA ATCAAATACC 
424751 TACATATATA AGAGAAAGAA 
424801 TCTGGATGAA AGTCGTAATT 
424851 GTCCAGAGTC AAGTCTGGAC 
424901 ATAATATTTA ACAAGGAAGC 
424951 TGTTGTGAAA TGTTGACAGG 
425001 ATGATTTATA TATTGTAGTT 
425051 GGAACAGTAT GGATTATGAT 
425101 ATCCACATGT ATTATAAGGG 
425151 TTCAGTGGCA GTTTAGAAAC 
425201 AAGCACAGAC ATATTTAAGT 
425251 TATCTATATG TGTGTGTATA 
425301 TGTGTGTGTG TACACACACA 
425351 ATCAGAATGT CTTCAACATT 
425401 TGCCTCATGC ATCAAAAATA 
425451 AAACCACAAC TCTGAAAGGA 
425501 ACCTGAACCA GATAAAATAA 
425551 GCAGCATATC TCACCACTCT 
425601 GCTCTATTCC AACATGCTTG 
425651 AATACATTTT CTTCAGTTTC 
425701 TTAAAGGATC TGGGTGGAGG 
425751 CCCATCCCTA CTGGCACCAC 
425801 CATGAAGCTT ACTTTTAAAA 
425851 TTTAGCCAAA AATAAAAGCC 
425901 ATCAATTATT ACTTAAAAAG 
425951 CCTTCTACAA CTTATTATGT 
426001 TTTATTAGAC TATAAAATTT 
426051 TCTTCCTCAG TGCCCAGCTA 
426101 CAAGTCATCA ACAGGCATAT 
426151 TAAAGAAACA CAAGTGGGTT 
426201 GAGTAGATCG GTTTTTCTAA 
426251 GAGAAAGAAT TGAGGTGGCT 
426301 TCGCCTTATT TGAAATAATC 
426351 TAATGGACTG ACATTATGAT 
426401 ATATTTAATT AAAAGTAAAA 
426451 GTATGTGCCT GCCAATTTTT 
- 426501 CAATTGACAG TGAAAATTTG 
426551 TGTTTTAAAG AGTATATTCA 
426601 TAATTGGCAA ATTCAAAACT 
426651 CTGCAAAAAC CTTTATCTAT 
4267 01 CAAAAATGCC GCATCTTGGT 
426751 ATCTGTGTAT AGCTTAGAAG 
426801 TCATCGAATC TGGTGGAATT 
426851 GAGAGTATCA GTGACTAAAT 



TGCTCCAGAT TTTCTCTACA TGTCAGATAA 
ACTGTGTGTT TAAATACCAA ATGGGTCCAC 
CCATGATGGG CCAAATATAT TTAGAAAGGA 
GTATCAAGAA GCTCTCAACC TTGGTTTCAG 
TGTTAATATC ATTGGATTAA TTGCACCACT 
AAATGTCTCA TGCTTGTCAT GTAATTGTTG 
AGATATTGCA AATTCCCCAA GGGTAAATGT 
ATTTCAATAC AGCAATTAAT TATAAACAGC 
GTCTCTCTCT CAACACAGAG GTCAGAAAAG 
ATGAAAAGTT GTGGTTACCA TCATACAGAT 
TGAAAATGAT ACAATCATTT ATTTTCACCC 
CTCAATACTC TAAGCATATT TTTTAACAGG 
GTGCTTTTGA CTAAAAGTCT TGTATGCTAG 
ATACTAATAA TAGTTCAGGT GGGGATTAAA 
ACAAATCAGT AAGTTGGTAA CATGTGAACT 
ACATTTTTCA ATGGTTTTTT GTTTTCTTGG 
GCATCAGATG TCATCAGTAT AAAATAATGT 
CTGAGAATTC AAGGCAGAAG TAGCACTATA 
AGGGAGCAGG GCGGCTTTGC GTTGTAATTG 
TCTACATCAC TCTTAATGTA CAGAATTAAG 
ATTATCAGGA ATTGCCACTA CTGATTAGCA 
AATGGACCAC ACATGGGAAA GGAAAGACAC 
AATTTATTTC TTGTCCGCAA AACTGGCCGA 
AGGAATCAGA CGAGAACCTT ATGACAATTG 
GAGGAAACAG GAAGCGTTTC AGAATATGTT 
TGACTTTTAA GGAAAAAAAG AAGTAAACTT 
AGTTGAAAAC ATGCAAATAC AGTTAACAGT 
CAAAGCTAGA AGTATTTTAA AGAACCTAAA 
TATATACATA TGTGTGTGTG TGTGTGTGTG 
TATAGTGAGA GAGCCAGGTG AAATCCAAAC 
TTTCCAGACA TCAAAATACT CGAACAATTT 
AGAATTCTAT CTTGTCTCTC TTCCCACTCC 
TTTTCCAAAA AATTGCTTTT ATCATGTTAC 
AATAAAATTA AATTAAAATA AGCATTGATG 
ACCACATCAC3 GTCTATGATC GCCAGGTAGG 
TCAATATATG ATTCTCTTAA ACTCTATATG 
TACTTGCAGT TCTTCTGCAG GGGGCTCAGA 
GACAATTTCT ATAAATATCA TCTTTATCAC 
ATTCACTCCT GTTACTACTA ATACTACGAT 
TTTGAACTAG CTTTCTCCAA TCCTTTCAAC 
TCATAACAAA CTGACCCATT AGGGAGGTTC 
GTCCTAAAAA TCTCGGTTTA AGTTGGAATT 
TAAATTTAAT TGTTGACATG CTCATCTTTT 
CAATGGCAGA AATTATGTGT TGTTAATTTC 
ATACTTAGGA AAGCATGTTA TATAAAACAA 
ATGTTACAAA AAAACTCAAG GCAAGGAAAA 
TTTGGCATGT CATTTTCAGG AAAAAGCAGT 
AATGAAGCCC TCTGAGTTGG CATGTAGAGA 
GTTAGGATGT TGCCTACAGG CTGTTAATTA 
ATTTTACTAT TCAGAACATT AATTTAATTT 
GGTTTAATTG AGGTTAAACA GTGAAGTCTT 
AAACCTATTA TAGAGTATTA TAAAAACCCT 
ACCCCAAGAA TTAGCTAGAA ACTCTCTTTT 
TTCTGCAATA TCTATTTGGT TTTACGTATT 
GAGAGTTTCA GAGAAAAGTA CTAGGCCAAT 
TCAGAGTTTT GGCCCTGCCC TTCAGCCCCA 
ACTTATAATG GACTTTTCAC TGCAGCTGAA 
ACCTTATGCA TGATTTTCTT ATCCTAAGAT 
CTTTGTCTTT TGACAGTAAC TACATGACAT 
GGTAAAATAA GAATATTGTA TCTCTACCAA 
CTTATTTTTC AATTTTAAGT CATTATTAAG 
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426901 ATCCTAAGAA AAGCCCAGCC AAAATATGGA GACAGATAAG AAGCTCTGTG 
426951 GGGTTCTGAG AGAAGAGATG AAGCTGGCCC AACCATTGTC TGATTTTTTT 
427001 GCTCTCCACT GAGAATGAAA CACATGGAAA TAGGATTGAA AATGAAACAA 
427051 GAATCAGCAT TGGACTAACA CAGCTGGATG TTAGGCAAAT GCTTATTATG 
427101 TGTAGTGTTT ACTTTTCCAG AGGAGAGGGA GACAGTTGTA ATACTTAAGA 
427151 ACCAAACTTC TAGAAATCAA ACATAATTCT ATTTATGCTT CTCCATCCCC 
427201 CAAACAGTAA AGCTCACCTT TGTTAATGTT TCTTGCTTTC TCAACTGTGT 
427251 TTCTTGCACT GTGCTAATAT GCCTTTTATC TACACTATGT TATTTAATTC 
427301 TTTTATCCAC CATGTAAGTC CCTCACTACA AGTTCTATAA GGCTGTGTAA 
427351 TTTGCCCAAA TTCACACAGA TTCTAAATGG TGAATATTGG GCTCATATCC 
427401 AGGTCATTCC CCAAAGTCTA ACCTCTTAAT GACTATTTCA CCTGCTAATG 
427451 AATATTCCTA CATTAAATTA TAATACTTCA TTGCAAGTAG GTTTTTTGTG 
427501 TGTGACTGAA TCCAGTGGAG TTAAAAGGGA TCTTCAAAAG GCACTCCGAG 
427551 CACCCTTATA GTTTCAGCAA AATGACATGA AACCTCTCTC CTTGTACCTC 
427601 TCAAAACTCT CCTCCTCTCT AAATAAATAA ATAAATGGAT AGAGGTTTTT 
427651 TCATTTTATA GCACAACTTT CTTTAGTTAA CCATTCCAAT ATTTATTACC 
427701 TCTTCTGCTT GTGCCTGCCT CCTTGTGTGT AATTCAATAC CCTCAAAATG 
427751 CAATTTGATT CAAACAGCTG GTCCAGGGAT GCAAAGTGCC TGTCTTATTC 
427801 TGGGTTGAGC GAGTGACTAG CCCTGGGCTG GCTGCTGTGG CAGGGCACAA 
427851 AGAAATGACC TCAGCCCATA AGGAGCTTAA AAACTAGTAG ATAAAATGAG 
427901 GCATTTACAC ATTCAGCATA TAGAGGCTAT ATTGCCTGTG TGTAATGAAG 
427951 TGGTAAGTTG AGTAATAGAC TAATACTAAC CAGATTTCAG ACAGCCCACT 
428001 TGAAGAGACA GGAGTAGCTG AGGGTGTCTT CCTAAATGAG ATGAACCCAT 
428051 CATTGGCCTT GAAGAAAACT AGCTTTTATT GAATGCTCTC TATTTGGCAG 
428101 AAAGCGTGGT ATATGGTAGA TACTTCACTT ATACTTGCTT ATTTGATCTT 
428151 CACAAAATGA GCATTATTAT CTTTTTATAG TTTATAGAAA ATATGGTTCA ' 
428201 AAAGAAAGTA TTGACTGTGT AAATCCACAC AGTAAGTTTT TGAATTAGGC 
428251 CCTGTCCAAC TTATCTGAAT CTGTCCATCT ATCCATCCAT CCCCATCCAT 
428301 CTATCTTTTT TTTTTTTTTG CTTTTTGCAT TACTTTACAA TTGCCTCCAT 
428351 GATTGCTATA TCTTGTGCAG TAATATTTAC AATTTCTTGC ACAGAATCCG 
428401 TTGCTTTTTT GCTAATAGCT CTTCTGCTTT TCTTTGTAGG ACTGTCCCAC 
428451 CTCAGTTCTC AATGTGTGTT CTTCAGGTAG GGCTGAAATC TTCCCATCAA 
428501 CTCTAGGGGT GAACAATCAG CAAAATAAGC CTCGCTGAAC ACAGCGATTA 
428551 TTTCAGCTGG TATAATTGGT CATTTATGTG TGTTGAGAGA AAAGGGCCTT 
428601 CTTTCTGTTA AGGATCTCAA CTCTGAAAAT AGGAGTCCAT GGTTTTTAGT 
428651 TGCCACCTTG CTACCGAAAA ATGAGAGGGT ATTTGAGAAA GACATCATCA 
428701 GAGAAAAAGC AAATGTCAAG TGACAGGGAC AGAAAGATTT GGGCACTAGC 
428751 ATTTGCTTGC CTGGTCATGC ATAAATTAAC TTTTTATTTT TGTCTCAACC 
428801 AGTTGGATTT GATGTTCTGC CATATAAAAT CAAATGAGCC CAGGCTTACA 
428851 TGGAGGGTAT TCCAAGAGGG GTTGACAATT TGGAGGTGAC AGGATGAACC 
428901 AAGGTTAATG ATGAGGAAAA TGACACGTTT TAAATGAAGG CATTAAAAAT 
428951 GTAAGGCTAA AGTGAAGTAG AGGTGTTACA TAGGAGACAG TAGAGCCCAA 
429001 ATCCCTTGCA CTAGCTTTCA TTCTTTGCAA TAAAGGAGAA TTCGGTTGAC 
429051 CCATGAGGAT GGTGCAGTGA CATTTAAAGC ACCACAGCAA AAATGCACTG 
429101 AATATTAAAT GCACTGTAGA AATAATGTGC ATCGTATGTT TGTGATAAAA 
429151 AGAAAGATAA ATATAATTGT AGAGACTCTA TCACAAATAG AAAATCTCCT 
429201 GTGAAATACA CTTATAGAAA CGGTCAATGT TTTATGCATA TTAAAACATT 
429251 CTCCAAGATG ATACAACTGC CAGCAGCACC ATAGTTCATT TTAAAAGACA 
429301 AATACATTCT AAAAATAAAA TAAAACTAGC AAACACATTA TTTGTTCAAA 
429351 TAGTATTTTG AGGTCAACCA AGATGCACTT CTAGTTCCTT GCAATTTGAG 
429401 GGATTTTTAT AAAACAGACT TTCAATTGAG ATTTTGAGAA ACACCAAGGC 
429451 AAGTTAAAAT CTTCATGACT TTATTCAACA AAAAGGTGGC AGCAATTTTT 
429501 TAAATTTTGT GTGAAAAGTC CTACAACAGC AAAAATGACC TATAAAATAG 
429551 TTCAAATAAG- AGCCTCATGG CAAAATGACT TTTTCATTCT CCTAAGAAAA 
429601 GTCAAACAAA GTATAAATCT ATTCAAACAG ATGGACTTAA ATTGCAAGCA 
429651 TTAATTCAAT TCAAACTTTG GTGCATTGTC TAAAAATCGC TGTTGAGAAA 
429701 TAGCTTTAAG TCTGAATTAT CAAAATTAGT AATTAAGATT CCTAAAACAA 
429751 TCCAATTCAA CAGTTACTGA GCAATTACTA TGTGCCTTAC TCTATTTAGT 
429801 ATTTAAGATA CAAATAAAAT TCATCCAGTC CACCAAGGAA TGGCCTTAGT 
429851 ATCCTCGTCT GTCATATAGG AATGATGATG GTACTCATAG GATTGTTAGA 
429901 ATGAGTAAAT TAGTTTAATA TACATAAAAC ACTTAAACAG TGACTGGAGC 
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429951 TTGACAAATA CTCTTTAGAG GGTCATTATT ATTGCTCTGT GAATAGGTAT 
430001 ATTCACAAAA CTGTGGAAGC ATTGAAGAGG TAATGGTTAG TTCTGTCTTG 
430051 GGACTAGAAA AGTCTTCACC AAGAATAAAT AATTTAAGTA AACCTTTAAG 
430101 GATGAGTAGA AGTTCACTAA ATGAAAAAGT GTGGGTGGCC TCTCTAGGGA 
430151 GAGACTGATG GGTCAAAATT CTAGAGCCAC AGTAGAACTT GATATGTTCA 
430201 GGAAGTGATG AACAGGCTGC ATTTGATTTG AAAGGGTACT GGAATTTTAT 
430251 TCTCTAGGGC ATAGGGGATA ATCAAATTTG TCTTTTTTGG CAGACTAGTG 
430301 GTATAACAAG AATTAGTTTT AAAGTTGCCC CTAAATTATC ATGTGAATAT 
430351 TATCATGTGA ATATACATGA GTAGCATCAT ATGAAATTAG AAGCACCTGG 
430401 CTTCAAACCC CATCTGTCAC TTAGTGACTT CTCTTTCCTG AAAATATAAA 
430451 TAATAATACT TAATTTGGAG ACCTTTCACT TAAAAGAGAC ATTTTTAGAC 
430501 TCCCATCATG TGGCAGGCAC TCTGTGGTGC TTTTCTATAT ATATGGTGAT 
430551 AAGTATATAA GATTTTTATA TTATAAAAGT GTCAGACAAT ATGATTACTA 
430601 ATGTGTTAAT ATCCATTTTA AAGTTTTTTG GTTTAATTAA TAAACATCAG 
430651 GCTTGGTTAC ACTCATTAAT GAAAGAAAAT ATGAAGCATA ACATAAGCAA 
430701 TTAAAATAGT ATAACAATAG GATAATAAAG CACATCATCC ATCTTCAACA 
43 0751 GCCATCAACT CACACCAACC TTGGTTCATC TGTTGCTCCA CTTACCTCCT 
430801 CTACTTTCTG GATTAAATTG AAGCAAGTTC CAAAGAACTA TTAATTCATT 
430851 TGTAAATAAT TAAATATGCA TCTCCAAGAG ATAATGACTC CTTGAAAATA 
430901 TCCTAATATC ACATCTAATA AGGAATTAAC AATAATTATG TAATATTTTC 
430951 AAATATCCAG ATTCTGCTCA AAATTATATT AAATTTATTT GTCTGAATCA 
431001 AGATTCAAGT AAGATTCATA TGGTAGTTGG CTGACATTGC TAAAGTCTTT 
431051 CTTGATCTTT TCATCTCTTG TTTCTGTGAA ATTTTTATTT TGTTGTTGTA 
431101 TTGTTGAAGA CACGAGGTTG TGTGTCCGGC AGTTTCACCA AGTGCAGATT 
431151 TTGCTTATTA TATCTTTGAG GTGTCTTTAA CAAGATCTTC TAGCCCCAAT 
431201 ACTTTCTGTC AGTTGTTGGT TAGATCTAGA GGTAGGATCA AAGTTAGATA 
431251 TGTCTTTTTA AACAGGAGTA CTTCACAGGC TTTGCTGTGT ATGTTTATCA 
4313 01 GGAGGCACAC TATGTTATTT GATCTTTAAT CTTGAGATGA TTGGTAATTA 
431351 TCATTTAGGA CTGACAAAAA AGCAATATTT TAATTCTATC ATTTCTTCTT 
431401 CTTTTAAGTT GAAGTACTTC TATGAACAGA AATCTTCCCT TATCAGCTAC 
431451 TTGGTTGAGA TACAACAGGT AAATTATTGA TAATTACCCT TTATTTATAA 
431501 GCTTTTAAAA TCGTATGTTG ATACCTCAAT AGTCTCCAAA GGTAATCAAA 
431551 GAGCCTTTTA AAATATCATT ATGAATGAAT AAATTGAGAA ATATCTGAGA 
431601 ACACCAAGCT TACTAGAGCT CAGATATTTT TCATCTTTGG GCAGTGAAAG 
431651 CTAAGTCAGG TAGGCTCTTG AATACTTTTG GGAAGTTTCT TGTCAACATA 
431701 TAAGATATTC CAGACTCATG TCATATATTT CCTTCTGCAG ACCTAGAATT 
431751 CATTATTTCT CCAAGAAGCC CTGGGTTTCC TTTTAGTGGA AAATAGAACC 
431801 TATCTAGAGA CCAAAATCTA GGAGTGCAGA ATGCTTATTG CTATTGGGTT 
431851 GTTCATTATT TCTAGGTCTT TACATTGACA GAGCAGAGGA TACTTTTTTT 
431901 TTTTAGAAAA TAAAATATAT TATAAATTTA TACTGATGAC TACAATTCAA 
431951 ATTTAGGGTT ACAGTATTTT AATTTACCCT CATCAATCCT TCATTTTTTT 
432001 TTTTACTTCT ATGTCAAAAA TCTTGGTTTT CAGCAACATA AATGTGATTA 
432051 CTCATTTGCC TTAATCCACA CTACATATAA AACACTCACT ACAAAATTAC 
432101 CACCACCAAT ATAATTATCA AAAACACAGT TTATGGTTTA TAGTTCCTGA 
432151 AGTTTATTTT GTCCTTAGGG TATATCTCAG TGTAGATATA CAGTGAAAAT 
432201 ACTGTGCTTT ATAATCATTT GCAATAGTTC TTCCTTCTAT AGTAATTCCA 
432251 CCAACTGGAT ATTTAATGAA GTTTATCTGT TTTCTTTACT TTCAGAATTT 
432301 AGATATTGTT TTTCAAATCT AAATTACATT TAATAATCAC ATAAACTATA 
432351 TATGTGAACT CAGAGTCAAA TCTTCAAAGA CTATTGAGAC ATGAAAAATG 
432401 TTGGTTGAGC TGCCACGTTT CACCAAGGGA ATTTGGCGGG GTGGTCACTA 
432451 AATTCATGCA TTTTTGAAAT TTCTAGTTTT AGAAAAAGAC TGAGGGCTCA 
432501 CCCATCTAGA AGCAGATCTA GTTTGAGAAT GAAGAAAGTT GGCTGGCACC 
432551 AAGCACCAAG GGACAGTAGC AATGTGAAAA CACTTCAGAA ACTGCTCAGA 
432 601 TCATTTTTGC TGGAAGAAAA AGTACACTAC CAAGTAATAT TTTAATAGGT 
432 651 ATTTATTTCA TCGGAAATTT TAAAAAATCA GCCACAATTT TCTTCTACCT 
432701 TTGTGAAATT TTGCGTGTGA TTTACGAGTT GGCATTAAGA GATCACTTAA 
432751 TACTTGCTGT TCTGCCCTAT TCTTTGGCAT TCTATCATAA ACATTTTCCG 
432801 TGTGGTAGCA TATTCTTCCT ATTTTCATTT TAATGGCTGC ATACTATTTC 
432851 ATGGGGTGAA TGGACCTTAA TTTAATTAGT CTGCACTACC GTAAAATATT 
432901 TAGGCAGTTT TCCCTTTTCT TTTTTTTCTG TTAQAATTAT CTTCCAGTGA 
432951 ACACACGAAG CTCCATAAAA GTAATTTTAA GCGCTTTGCT TTAGCTGTGG 
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433001 CAGCAGTCGC AAGTCGTATT TTTGATCATC TGGGTGCACA ATACTTTACC 
433051 AGGTCTTGCT CACCAGTTAG ATAAAATATT TGCTAGTTAT GATGCAAGGA 
433101 AGAGCAAAAA TAGTAAATAT TATTTTGAAC CAGCTGCCAA GAGAACAGAT 
433151 GGAATCCCAC AGTATCCAAA ACATTTGACC TGAAGTT£TC CTCTGTGACC 
433201 AGATCCATTT CTTCTCAAAA TACATTTTTC AAGCAGTAGT TCCCTGACCA 
433251 GTAGAAAATG CTGCTTTTAT CTTTCTCTTC TTTTGTAAAT CATACTGTAT 
433301 GCTATCTTGT TAAAACACCC CATGCCAGAG AAGTATGAAT GAATAAATCT 
433351 TCCTGCAGAT CTAGGTTTCC TTCAGGCATA ACTAAACCTG GCAAACACAC 
433401 TGTTAAAATG ACCAGCCACA GCACTGGACC CCAAGAGAGC TCTATGGTCT 
433451 TACATAATAA CTGCACAACA ATCAAAGAGG CCAATAGAGG GGGTGAGCTG 
433501 GGATTTTTTT TAGGTAGGGG GAGGGAGCGG AAGTAAAAAC AGAAGAAAAT 
433551 ATGAGCTGCA GGGCTAATTG GATGTAATTT GGAGATTTCA AAGAACACTG 
433601 ACTACAAGGT TTGAAGAGGT GCAGTATTTT TGAATGATGA GGCACCAGAA 
433651 ACTCCCTAGG TAGCATTTTG AATTTAATTT - AAATGAGAAT CCAGTATTTG 
433701 TGCAAACCTG GGGCTGTTAG GATGCAAGAC TATCTTGGGA TTGGCATTGG 
433751 CAAACCATGC ACAGAGCTGT GATGAAGAAA CCCTGTCAGA AGACGGTGCA 
433801 TTATGGCTTT AGAGAGTGAA TTTCTAGACA GTCAAGGTTG TGTGAAATGG 
433851 GGTACTTTCA GCATCCCTAG AGATGATGTC TTGGTTGGTT GGATCTTGGA 
433901 TTTTGCAGAT TACTGATGAA AACATTTTTT GGAGATATAA GGGACTGAAG 
433951 GCATAATATG AAATCTTACT TTCAAAAATT TCTATCAGTT AAGTTCAAAG 
434001 AGTCATTTTG GGGCTTATGG GGATTTATAA TATAAGTTTG TGACAGGTGA 
434051 CATTATTTTC TTATACTGGG CCTTGAATTA GATTGAATAA TTAACTGTGA 
434101 TTTAAGATTT GAAGAGGCCA GTATTTTCTA AATACTAAAA CAGTGGTTTT 
434151 GGACGTTTTA TGCTTCAGAG ATTTAATTCC TGCAAGATTA ATTTCTTTGT 
4342 01 AAAACCCAAA AAGTATCTAT ATTTTGAATC AGACATCCAG ATTCTTTAGG 
434251 AAGCAATTTT TCACCAAAGA TAATTGCTGC TGAGATTTAA ATGGATAGAA 
434301 TCCTAGTCAC CTTGGAGCAT GAAACATGTC TAATTGTCCC ATCATTTTAT 
434351 TAGAATATTA TTTTATTTAA AATTTTGCCA ACTTCATTAA GACATAAAGA 
434401 CCAAGTGAGA AAGAGCAATT CATCTAAAAT AATTAGCTGG AATTTTTCAT 
434451 TCATGGGTCT TTTGCCAACC TTAATCATTA ACCTGGACTT TAACAACTTG 
434501 TGCATTCTTG CTGACAGCCT GACTCTCATC TTGAACTCTC CAACCCAACT 
434551 GAAGTTTTTT TCTTGAGATA AACTGCACAC TGACTCTTCA AGTATCTGAA 
434601 CGGCGGGGGA AAGCTTAGAC AAATAGATTA GTAACAATGC TCAGGTGTGT 
434651 TGAGAAATGG GCGAACGAAC GGCTGTCCTT GACTAACTCT CTGGATGCTC 
434701 AGAGAGACAT TTTTATTGTT TCGTGCAGTC CTTGTTTAAT ACCATGTTTT 
434751 GAAATACACT GCAAAGTTGA CAAGACGAAC AGTGGAGAGG TGGAGGTCTT 
434801 CTTATTTCTT TGTCTTTGTG GCAAGTGTCA AATCTCACTT TTTCCCTAAG 
434851 ATTCCTAGAA TGCACAGCTA AGAGGAAACA TGAGAATTTA TCTAGAATAA 
434901 TGGCTTAACA ATTATTTCAT CATGACACAT GAGACAGCTG GTGGATCATG 
434951 TTGTGCACCT CCCATGGGTT ATTTTAAATA GTTCCTCAGG TGTTTGGAAT 
435001 GCGCAGCCAA ATTTGTGAAT AACTGGCTGG TGGAAAGAAG ATGGCACCAT 
435051 ATCCTCGTTG ATAAGATAAA AGCTTTAGAG TAGAAAATTG CTATCATTAA 
435101 TTTTCCTTCA AACTTTCTCT GCCCCCTCTT GCCTCTAAGT CAGATCAATT 
435151 GTCAGCCTGC TTCTCTGCCT TCGCTGAATT GACAGTTGCT AACACACTGC 
435201 TAATGGCTAA CTGGGACTTA TTGATTACAG AAAAACGTTT GGCTCATTCA 
435251 AAATCAAAGT GATAACATGT TTGAAAAGAT GCTTAATATA TGTATTTAAA 
435301 AACAGCTCTC ACAAAAGATT GGTTAAGGGC TTTAAGATAC AAAGAGCTGT 
435351 TATTAACAAA TAAAGTGGAC TTGGGTCGTT AACGGGTAAT GTATAAGAAA 
435401 ATTAAAATGG GTGATAACAT TTTAAAAGAA GTTTGCCTTT ATTAATAATA 
435451 AGATAAATAT AATTTCAAAC TATAACAGCA TAGCACTGTG CATTAAATCT 
435501 CATTTAATTG GCAAAAGGGC AAAAATGATG CCTGGCAGTA GTGAAATAGT 
435551 GAGGTTTCTA CATTGCAGAG GTGTTGCTAA TTGGCACAGC CACTCTAAAG 
435601 AAAGATTAGA CGATATGGAC CTAAATTCAT TAAAATGTGT ATACCGTTGA 
435651 AGCAGCATTT CTAGTAGTTA GCATAAGAAA ATAATTTGTC AAATATTCAA 
435701 AGATGTTTAT ACACAAATGT TCATAGTAAC TTTTTAAAAT ATTCAAAATA 
435751 AAAAAAACCC TCAGTGGACA CCAATGAATA AGTTTTGAAT ATCCTTTCAA 
435801 AAGACTATGA GATAGCCATG AAAATCAATG ATACAGATGC AAGTGTATCT 
435851 ATATAAATAA ATCTTTAATA AATATACAGT GTTAAGTAAA 2^ACACCATAC 
435901 TATAAGACCA TATGTATGTA CGTATGTGTA CATATATACA AATATACATA 
435951 CATATCTATG TAACTTGATA AAATTTGGAC GATTATAAAT GAACATATTA 
436001 GCAATATGCT CCCTGAGATG ATCGTTTTTT CTCTCTCTAG TGCTCTATAT 
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436051 
436101 
436151 
436201 
436251 
436301 
436351 
436401 
436451 
436501 
436551 
436601 
436651 
436701 
436751 
436801 
436851 
436901 
436951 
437001 
437051 
437101 
437151 
437201 
437251 
437301 
437351 
437401 
437451 
437501 
437551 
437601 
437651 
437701 
437751 
437801 
437851 
437901 
437951 
438001 
438051 
438101 
438151 
438201 
438251 
438301 
438351 
438401 
438451 
438501 
438551 
438601 
438651 
438701 
438751 
438801 
438851 
438901 
438951 
439001 
439051 



TTTATAATTC 
AAATGAAAAA 
TTCAAAACTT 
AAAAGCTCAG 
TTCTGTCCAG 
TAGGACAAGC 
GAGTTAATAG 
CATTTAAATG 
ATTCTTCCCA 
CACATCCAGA 
GGTTTACCAA 
AGCTGACTGT 
ATGTGGTGTA 
TGTGCCCCCA 
GGGAGGGATG 
TCGATTGAAC 
GCTGTTGCAT 
TTATGGCTGG 
AACTGCTTAA 
GACCAAATGT 
TTAAAGGGGG 
ACGGCTGCTG 
GCAAGTAAGA 
GAGTCCCCTC 
GGCCACGTTC 
TGAGGGAGGA 
GCTAAAGGAA 
GACTCTGTTA 
AATCTTTATG 
GTCCAGTGTC 
ATTTATCATT 
TGTACCATGC 
ACCATTATTG 
TAGTGTCTGG 
GTAGCTGTTG 
TTC TAATTAT 
GCTTTTTATT 
AATATCCCAT 
ATATCTTCTT 
CGGGTTAAAT 
ATGCTTTCTT 
TGTATCCACT 
ATAAGAAATC 
ACCATTTCTC 
AAATTACAAT 
ATATATATTT 
TTGTTTTCCT 
GGTTTTTTTT 
ACCCAGGCTG 
TCCTGGGTTC 
ACAGGCTCCC 
CAGAGTTTCA 
TCCACCTGCC 
GTGCCCGGCC 
CATTGTGATT 
TTCCTTCAGG 
TGCTCTCTCA 
GAATTAGCTA 
TTTAGAAAAC 
CATTGCTTCT 
GGAAATCAAT 



ACTTCTAAAA 
TTCCTTTACT 
ACTGGTCGTG 
AAAGACAACT 
TTCAGGATAA 
AAAGTCTGAT 
CAATTCTCTT 
AGAAGACAAA 
TCTCAAGCTT 
TTCCAAAGAG 
TGAGATAGCC 
CCACCTCTGA 
GATTACAAAA 
ACCTCTCTGC 
AAGATGCTTC 
TATCTGGAAC 
TGGTATCTAG 
CCAGTTATTC 
ATGCCCAGAG 
GACCAAGATG 
TGAGAAAACC 
AGCAGAGAGT 
TGCTGGAGTT 
TAAAAGCATG 
CCCTGCTCTA 
GTTAACAGGA 
GAGACATGGT 
ATGATCTGAG 
ATCTGAGAAT 
AGGAAAAAGA 
TAAACTAAAT 
ACCTCTTAAG 
GGACTAAAAA 
TATATATGCA 
TCATTTATTC 
GTTTCTTTAG 
CCACTCCATT 
AATTTGTATT 
CTATCCCTTA 
ATTTTTTCTC 
ATTATGTCAT 
TGGCTAATAT 
TAATTCTGTG 
AACTAATGGT 
ACTATGGACT 
GTATTTATCA 
TTTCTTTTGT 
TTTTT"TTTTT 
GAGTGTAGTG 
ATGCCATTCT 
ACCACCATGC 
CCGTGTTAGC 
TGGGCCTCCC 
AGGGGGTTTT 
ACTTTTGCTG 
CTGGCTCTTA 
CTTATTCCAG 
TTTCTCCATA 
AAAATCTGGG 
AAGTCTTTTT 
TACTTTGCAT 



TTTTAAAAGA 

TTAGACAAAT 

ACCTCTTAAG 

GGAACTAGAA 

TACTAATAAA 

TTGGAAAGGT 

CCCCCTCACC 

TATATTATTA 

GGTTTAAACC 

AGGCCTGTGG 

AAGAGCCAGT 

TCTATAGAAG 

GAGTTTTAGA 

CCTTCTGGTC 

TCCCTGATTT 

ATTTGATATG 

TTGATGCCTG 

TCATTCAGGA 

TTTATCAGCA 

GGAGGTGGGG 

TCAACAGAAA 

GAGGAGCTGG 

TCCGACAAAC 

CCTCCTTCCG 

CCCCGCCTCT 

GAAAATGAGG 

TTCACTTTAA 
AGTACTTTTG 

GACCAGAAAA 
CTAGCGAAAG 
CATACATATC 
AATAGGGACA 
AAAATCAATT 
AAATTTCCCA 
ACTTATGTAT 
TCTCTTTTGG 
TTTTAAAAAT 
TGGTTGATTG 
TATTTTCTGT 
TGTCTTTTTT 
ATCAGATTGT 
TGTTACTTTA 
GTTGGGGAAT 
TTTAGCACTC 
TGAAAAATGG 
GAGAGTTTTT 
TTGAGCATAA 
TTTTTTTTGG 
GTGCGATCTC 
CCTGCCTCAG 
CCAGCTAATT 
CAGGATGGTC 
AAAGTGCTGG 
AATTTATATC 
CTAAAATTAT 
CATCCATTTT 
GATCTCCTTA 
TAGCTCTGGT 
TGCTAGATGT 
AGTGGGCAGA 
TGATATTTTC 



ACACATTATT 
GTAACAAGAA 
AGTTTATTTC 
ATATAGGCAG 
CTAAAATAAT 
CAGTTTATGA 
CCATTCCATT 
TTACGAAAGA 
CCAAGAGAAT 
GTGCCAGATT 
TTTACAACCC 
GGATTTGACA 
AAAGATTTGT 
CCCACCCCTC 
CAAGTCTAAA 
TGTCTCCCTA 
AGAGATCAAA 
CAAAGATTTG 
TATAGTTTAT 
AAGGAGATTC 
GAGACCTGTG 
AAGAGGTCAT 
CCACACATTT 
AGACAGCACC 
ATCCTGAAGA 
AGTTTTTCCT 
ACTAAGTTGA 
GGGACTAAAG 
CTCACAGGGC 
ACTCTCTCTC 
AAGACACTTC 
ATTGCTTTCA 
ATCAATAATT 
AATGTTGCAC 
TTAATTAGTA 
TCTAGAATAT 
CCAGGATATT 
TTTCATCTTG 
AAGCTTAAAT 
GCAAGAATAG 
TCTGATATTA 
CAGTTTTTCC 
TGTGAGTACT 
ATTAATGATA 
TAATTTTATA 
CTCTAAAAGA 
CTGTGTAACT 
AGAAGGAGTC 
GGCTCACTGC 
CCTCCCGAAT 
TTTTGTATTT 
TCGATCTCCT 
GATTACAGGC 
AATGGGTTTC 
CCCAAATTTG 
TGAGAGGCCT 
CCTTACAAAC 
TCTTTTCATG 
GCTTACTGCT 
ACTAGAAGAT 
ATTAAAAATT 



AAATTTCTGA 
ATTATACTAT 
AGTGTCTTTG 
GGGTTTTTTA 
TCCAACGTTA 
AATGAGAGTC 
ATAATTTGAT 
TGTTATTAGG 
AGCTGCCAAC 
ACCAAAGCAA 
TTAAACCCTT 
GATTGAGAGA 
AAGGATACTG 
CCTACCATGT 
AATGGGAGTC 
GAGCTAGGGT 
AGGCTTGAGA 
CTGCAAGGAA 
GTTTTTTAGG 
TGGAGTTGTT 
TTGAATGCCT 
TTGCCAGCTC 
GACCCACTGT 
ACTCACTGCA 
AACCTGTGAA 
CAAGGCCAGA 
GCATTTGGAT 
GGACTGGAGA 
CTGCCTGCGT 
CTTTATTTGT 
ACTTCTAAAA 
GGACCACACT 
TCCAATCATC 
CAGTGTCTTT 
TCTAATCTGG 
TCCATTCGTT 
CTTTTTATAG 
TTGCAATTAA 
AAGTTAGCTT 
ATAATTGGGA 
GATATCTAAG 
TTTGTAGTTC 
CTGCTCCCGA 
CATTCTTAAT 
AGTACTATAT 
AGTTTCCTTT 
ATGGACTTAG 
TTGCTCTGTC 
AAGCTCTGCG 
AGCTGGGACT 
TTAGTAGAGA 
GACCTTGTGA 
GTGAGCCACT 
AATTAATTAC 
ACAAGGAATG 
AATAAATACT 
CTCAGACATA 
GGCAATCATA 
ATTGGGATGT 
ATATTAAAAT 
TAACATTATA 
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439101 GATTTGTTTT CTCCTAACTT TATTTATTTC ATATTGGTTT TTCTTTCTTT 
439151 TTTTTTTTTT TTTTTTTTTT GACAGAGTCT CGCTCTATCA CCCAGGCTGG 
439201 AGTGCAGTGG CACGATCTCG GTTCACTGCA ACCTCCACCT CCCAGATTCA 
439251 AGTGATTCTC CTGCCTTAGC CTCTCAAGTA GCTGGGATTA TAGGCATGCA 
439301 CCATCACATC CAGCAAATTT TTTGTATTTT TTTAGTAGAG ACAGGGTTTC 
439351 TCCATGTTGG CCAGGCAGTC TCAAACTCTT GACCTCAGGT GATCTGCCTG 
439401 CCTTGGCCTC CCAAAGTGCT CGGATTACAG GCATGAGCCA CCGCACCTGG 
439451 CCAGCTGGTT TTTCTTTTTA TTACACTGAA GATCTAACTT TTTGTGGATT 
439501 TTATTTTATT TTTTATTTAT TATTATTTTT TGAGATGGAG TCTCATGTGT 
439551 CACCCAGACT AGAGTGCAAT GGTGGGACCT TGGCTCACTG CAACCTCCGC 
439601 CTCCTGAGTT GAAGCAATTC TCCTGCCTCA GCCTCCCGAT TAGCTGGATT 
439651 ACAGGTGCAC ACCACCATGT CTGGCTAATT TTTTTGTATT TTTAGTAGAG 
439701 ATGGGGTTTC CTTATGTTAG CCAGACTGGT CTTGAACTCC TGACCTCAAG 
439751 TGATCCGCTG GCCTTAGCCT CACAAAGTGC TAGGATTACA GTCATGAGCT 
439801 GCCAAGCCCG GCCTTTTTGT GGATTTTAAT TGTCTTAATG GGTTACAATT 
439851 TGCTTTATCC TAATTGTTTA TATGTGTGTA TTACTATTAA ATTTACTGAG 
439901 TGAAATTTAG ATACAACATC TTCTTGGAAT ACAAGCTTTG CAGATAGATT 
439951 TAAAAATATA CCCCACTAAA ATGTAGAGGT AATACTGTCT TCAACAGATA 
440001 TTTGAAACAA TTCTTTAATC TGCATGGTCT TATCAATTAA GTTCATTTGG 
440051 TAAACCAAAT GAATTGATAA AACCATGCAG ATTTTGGCAC TTCTGCCTGT 
440101 GTTTTATCAT CTGCTATCTT TTTCTTTTAG GTAAGATTTT GTTTCTCAAA 
440151 TATGAAAAGC TTTTACATAG TTCAAAAGTT AAATGAATAT AAAAAGTATA 
440201 TTCAGAGAGG TCTCATTCCT ATCCTGTCCT TTCTACCCAA GTCCCTATCC 
440251 TTATACAGAG CCATTTTTTT AAGTTTTTTT TTATTTGTCT TTTCCTATTA 
440301 CCCTATCATT AATTGGTGAA TGAGGCAGAG GAGTTATTTG CCTGAACACA 
440351 CATGGCCACA GAGTGGAGGA GATAGGATTT AACCCAGAGC AGTCTGGTTG 
440401 CAGAAGTAAA CCCTTAGCTA CTTTGCAAAC GGCCTATTAA AGTAAGACTG 
440451 TGGGGGAAGA CAGGCTTTGA AATTAGCAGG TTTTATCACT TGCTATCTGG 
440501 GTGACTTTGA GCAAATTCCT TAACTTCTCT GAGGTTTATT TTACCCACTT 
440551 ATGAATTAGG GATAATAATG TTTACTTTGC AGAGCGTAAC AAAGAATAGA 
440601 GATAACTTAA GCAAATTAAG TTGTAGATTG TCAGGCACAT AGTAATTACT 
440651 CAGTGAATGT TACCCATTAT TATAACTAGT AAAACTATAA TAGTACGACT 
440701 CATACAGAAG AACACTAGAA ACTTTTATTG TTCAGTTGTC ACTGAAGTAT 
440751 GTGTTGATCT GATCTATGTT ATTTAGTAAG ATTAAAATTT TCTAAGAGTT 
440801 GATGAACTTG AACAAAAAGC AAACAAGCCT GGGTTTGATA ATTTATGGTT 
440851 TCAAAAATGT TAAAATAATC TCAGAGGGCT TTGCTGATTT TGGTTCATTT 
440901 TTCAAAATAG AAGTTTCAAT ATTTTAGTGG AAAAGAATGT TTTACTACTA 
440951 CTAGAAGTCA TATAGAAACA ATAAGGTAAT TTCTAGCTGA AGTGAGTTTT 
441001 ACGTAAACTT GTTTAACTTC CTCATTGTTT TTTTTTTCTG TTTGCAATAT 
441051 AAGATATTCT TATGCTCCTC ATGTTTTAAA GAATTTTTAT TCTGTGTTGA 
441101 ACACAAAGAA GTTTGAACCA TGCAGAGGTT GAATTTAATT TGTATAAAAT 
441151 CCTCTTAATA GCATGGTCTA ATTAAGACAC CCATTACATT CATGGAATGC 
441201 TAATTTAATT TGGAGGGAAA TGTAGTTGAG CACTATTGGG TTTACCATAA 
441251 AATTAACACA GTGGACATAA AATGGATGTG CTTATTAGAT ATTGTTATCA 
441301 GAAATATTAT ATGAATGAAA ATCACAATAC TGCTATCTTG ATGGAAATAA 
441351 TTGTTTTACA GAATAAGGCA TACTGTGTGA TTAACTAGCT AATTTATATG 
441401 AAGCTATATG AGCAAGACTG TTAGTGACCG CTTTTCTGAA GACAAAGATT 
441451 ATTCAATCGT TTTTTTCTGA CATTAGTAAA TGCCTTGGCC CCTGCCTTAG 
441501 AGAGTTTGTT ATATTGGTGA AATTTATCAT GGTAGATTGA ACACATTGAC 
441551 TTGTATTAAG GCAGCAGATC TCAAACTTTT TATCTCAAAA TCTTTATACC 
441601 CTTAAAAATT GTTCAGGACT CCAAAGACTT TTTATTGATG TGAGTTATAT 
441651 CCATTGCTAC TTACATAAAA AATTAAAATG AAGACATTTT AAAAATATTT 
441701 TTAATTTATT TAAAATAGCA ACAATAAAAT CTCATATGTT AGCATAAGTA 
441761 GCATAGTTTT TTAAAACATG AGGATATTCC AAATGTTGAC AGACTTCATT 
441801 GCAAATTGTA AGAAAATTAC ATTTGTTAAT ATCACATTCT TGAACATCTA 
441851 TAATTACTGA GAAGCTTTCA ATGTGTTGGT GAAAGACACA CGTTTCAGAA 
441901 TTTGACTTTT TGCTTAAAAG CTGCCAGTTA TTTTCTTCAA AGTGACAGAT 
441951 TCACTCATTC ATTTTTTTAA GAAAGTGTCT GCTGAGTACT TGAGTCTAAC 
442001 CATAGTTTAT TAGTCATTCT TTCAATAAAA ATCATGTTAA TGTAAAAAGC 
442051 AGATAGTTCA TTTCACAAGT CAAACATTTG CACAAATATT TTTCCTTGAA 
442101 ATAATCACTG TGCTTTGATA TGCAGCAGAA CTGCTTTATA TAAACTTCCC 
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442151 ATTTTGTCAC AGAGAATATT AAAAAGATGT GCAACAAGTA GTTGATATGT 
442201 AAATAAACCT AATAATTTTT ATAGCTTCAA CAAGAACATT CTTATATAAC 
442251 ATTGGCATTT TCTCCTGCCT GCAGGCGAGT GGCCTTGAAA AATACCTTGA 
442301 TTGCTGGAAG AGTTCAGTGC CAAATATTCG GTGCCAAAGA CACTAGCAGT 
442351 TCTACCCAAA ATTCCTTTTG CATCATTGGC ACCAATGCCA ATACAGTGAA 
442401 AAAGTCAAAT AATGTCCTAG AATTATTAGG AAAACTGTTT AGACCTTGAG 
442451 GAACACTACA AAGTGTCTTG GAGATCCCCA GGGCTCTGTG GACTATACTC 
442501 TGGGAACGCT GTAGTAAGAT CCATGTTTGC TCAGATGTCC TTGAATCAGG 
442551 ACAGAGAAAG CATTTACAAA GAAAGGTGGC TCTGTTTCTG CTACGGATGC 
442601 AGACTACTCT CCATGCTCCC TGAGAATTTG GGGTACTATG AGCCTTTGTA 
442651 GGAAGATGCA AAACTTTTCT GGCCCCAAAT CCCAACAGTA TACCTCCTTA 
442701 GATGGTTTTT GATATAAGAC TTTTGTGTCA CAAACAGTCC ATAAAGAGGG 
442751 ATTCACTTTG AATATTTGGA TGGTGGAGGA AGTGAGGGTG GGTAGAATAT 
442801 GTAAAGAGGC TCAAATCACA AATCACACAA TGGTGGAACT TTTGCTATCG 
442851 CCACATACTT GGAAAGATGA CTGGAGATAC AAATATAGAA GGAACCATTT 
442901 GCAAACTAAC ATAGTACAAA TTGTACTTTT CTTAATTCTT TGGATTTTAT 
442951 TATTTACTTT TTGAAGAACA TTGTTTTACT TTGGCTAATT TGAACAAGAT 
443001 ATTTGTTACA AATGGGATGA GATTCTAATA KTTGTATTT TAAGATCCCT 
443051 AAATAAGCAA GGTTTTGTTT CAAATATTTA GCCTTGGAAT AAAACGTTAA 
443101 CACAGTATAA TATAAATAAT TATATTTCGT TAATGCTTTT TACCACATAG 
443151 GTGATGATTA AGGGATGAAA AGGGATTTTC ATTCAGGATA AGCTGTTTAC 
443201 AAGATTGAAG ACTTTAGGAG GATCATCTGA TCTGTTACCC TCACTATTGT 
443251 CCTCCAATCC ACACAGGAAT TGGTTGAGCA TAATTCAGAG CAGGCTGTCA 
443301 GGGGAACGAC TGGGGCCCAG AAGCTAATTA ATACAAATCA CACTTAAGAT 
443351 TTTATTTCAA GTGCTGTGCA GAAGCTCTTT AGTTTAATTA GGTCCTACTT 
443401 GTCAGTTTTT GTTTTTGTTT CAGTTGCTTT TGAGGACTTA GCCAAAAATT 
443451 CTTTGCCAAG GCTGATGTCA AGAAGGGTAT TTTCTAGGTT TTCTTCTAGA 
443501 ATTTTTATAG TTTGAGGTCT CACATTTAAA TCTTTAATCC ATCTTGAGTT 
443551 AATTTTTATA TGTGCTGACA GGTGGGGGTC TAGTTTTATT CTCTTGCATA 
443601 TGGCTAGCCT GCTATCCCAG CACCATTTAT TAAATAGGGA GTCCTTTTCA 
443651 CATTGCTCAT TTTTGTCAAC TTTGTCAAAG ATCAGATGGC TGTATGTGTG 
443701 CGGTCTTATT TCTAGGTTTT CCGTTCTGTT CCATTGGTCT GTATGTCTGT 
443751 TTTTGTACCA GTGCCATGCT GTTTTGGTTA CTGTAGCGCT GTAGTATAGT 
443801 TTGAAGTTGG GTAGGGTGAT GTCTCCAGCT TTGCTCTTTT TGCTTAGGAA 
443851 TACTTTGGCT ATTCAGGTTC CTTTCTGGTT CCATATAAAT TTTAGAATAG 
443901 TTTTTCTTAT TCTGAGAAAA ATCATATTGG TAGTTTGATA GGAATAATGT 
443951 TAAGTCTGTA AATTGCTTTG GGCAGTATGG CCATTTTAAT AATATTCTTC 
444001 TAATC AATGA . GCATGGAATG TTTTCCATTT ATTTGCATCA TTTCTGATTT 
444051 TTTTCAGCAG TGTTTTGTAG TTCTCCTTGT AGAGATCCTT CATCTCCTTG 
444101 GTTAGCTGTA TTCTTAGGTA TGTCCATTTT TTTGCAGCTA TTGTAAATGA 
444151 GATGTGTTCT TGATTTGAAT CTCAGTTTGA ACATTATTCG TCTAAAGAAA 
444201 TGCTACTGAT TTCTGCACAA CAAAAGAAAC TGTCAACAGA GTAAACAACC 
444251 TACAGAATAG GAGAAAATAT TTGCAAACCA TACATCTAAC GAAGGACTAA 
444301 AATCCAGAAT CTATAAGAAA CGTAGGCAAA AAACAAATAA TCCCATCAAA 
444351 AAAATGGGCA AAGGGCATGA ACAGACACTT CTCTAAAGAA GACATATAAG 
444401 TGGCCAACAA AAGTGAAGCA AAATGTTCAT CATCACTAAT TATCAGAGAA 
444451 ATGCAACTCA GAACCACAGT GTGATACCAT CTCCCACTAG TCAGAGTGAC 
444501 TTTTATTAAA AAGCCAAATA ATAACAGATG CTGGTAAGGC TGTGGGGAAA 
444551 AGGGAATGCT TATACACAGT GGTGGGAATG TAAATTAGTT CAGCCACTGT 
444601 GAACTAATTT GTAAACTCCA GTTTGGAGTT TTCTCAAAGA ACTTAAAACA 
444651 GAAGTACGGT TAGACCCAGA AATCCAATTA CTGTGTATAT ATCCAAAATA 
444701 ACATAAATTT TTGTACCAAA AAGGAACATG CACCACTTTG TTCATCGTAG 
444751 CACTATTTAC AATAGCAAAA CCATAGGAGC AAGTTAGGCG CCCATGAGTG 
444801 GATTGGATAA TGAAAATATG GTAAATATAT TACCA'^GGAA TACTACACAG 
444851 CCATAAAAAG TAATAAAATC ATGCTCTTTG AAGCAACATG GATGTGCTGG 
444901 AGACCCTAAT CCTAAACAAT TTAGTGTAGG AACACAAAAC CAAATACTGC 
444951 GTGTTCTCAC TTATAGTGTA TATTAGTCTG TTCTCACACT GCTAATAAAG 
445001 ACATACATAA GACTGGGTAA TTTATAAAGG AAAGAGGTTT AATTGACTCA 
445051 CAGTTCCACA TGGCTGCAGA GGCCTCACAA TCATAGCTGA AGGTGAATGA 
445101 GGAGCAAAGT CATGTCTTAC ATGGCGGCAG GCAAGAGAGT GTGTGCAGAG 
445151 AAACGCTCCT TTATAAAACC ATCAGATCTC ATGACACTTA CTCCCCATCG 
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445201 TGAAAACAGC ATGGGAAAGA TCTGTCCCCA TGATTCAGTT ACCTTCCACT 
445251 GGGTCCCTCC CATGACATGT GGGAATTATG GGAGCTAGTC AAGATAAGAT 
4453 01 TTGGGTGGGG ACACAGCCAA ACTATATCAC AGTAGAAGCT AAACATTGGG 
445351 TACTCATGGA CATAAACATG GGAACAATAG ACATTAGAGA CACCTAGAGT 
445401 AGGGAGAGAG GGAGGGGGAC ATGGGTTGAA AAACTACTTA TTGAGTACTG 
445451 TGCTTAGCAC CTGGGTGACA GTTTCCATAC CCCAAACCTC AGCATCATGC 
445501 AATATACGCA TTTAACAAAC CTGCACATGT ACCCCTTTAT CTAAAATAAA 
445551 AGTTGAAATT ATATTAAAAA TTATTTCATA AAACAAGCAA GTTGTCAATT 
445601 TGCAAAGAAT ATTTCAAGTT TAAGGCCTTT ATTCTGTACT AGGATCTTCA 
445651 AAAGATACTA GACAATTTAT TTTTTCTTTT ATCAAGTTAT TACCTTAACA 
445701 TAGATTGTTT TCTTAGTTGT GAAATAATTG TAAATTCCAT GTATTTCTTT 
445751 TGGGGGATAA ATGAAATTTC AAATGGATTT CTATTGGAAT CAGTTGACAT 
445801 CTGGAATTAA AATCTTGCTG AAGTTTCTGT GATTCCATCT AAGATACTAG 
445851 ATTTTCTTTG ATTTTTTTTG TTCTTCACTG TTTCTAAAGC AATTATGGTA 
445901 GAAACAAGAC TAAATGATAG AAGATAAATG AGACAAATAG GATGTAGTTA 
445951 CCAGAACATT GGATTGTGCA AAATGGAACA TTTCAGCTGC TATTTCTATC 
446001 ACTAGAACAT TATGACTTCA TGTGGATTTT AGAAAAGTAA TAGGTAAACG 
446051 ATTCCCTAGC TCAAGAATCT CTGAGGACAG CTGAAGGCCT GGAGACCAAC 
446101 AATAACTAGA GGGATTCAGA TGGGTGCTGG GCTTGCTTGC TGAAGCGTGC 
446151 TTGGCTGTGT CCTGGAACGC TGAGTAGAGT CTGAAGCTGG TCAGAGAAGG 
446201 TCATACCCTA TTTGGCTTTT GCTGTTGTAA TCTTAGAATC ATGTTTTCCC 
446251 TTATTCTAAC AGCTTATCTT CCTTGCCCAT TTTATTAGAT GCCAAACTCA' 
446301 TATAATTTTT TTAACCTCTA AATATCTACT TGTATAGGAC AAAAATAAAC 
446351 TTTCAGCAGC CATAGTGTAT GAAGGAGTAT CTGTGATTCT TCCATAGTGT 
446401 AGATAAAGAG AACTAGAGAG GATACACCTC GTCATGCCTC AGGTCAAGTC 
446451 AGCCTCCCCT AAGGCTCTAG CGGCCATCAG ATTGTAGTGG TTATTTCCTG 
446501 TTCCCTCCTG TCTCTGTCCT CCCTCCCCTT CATCAGGGCA GCCTGCTGTT 
446551 CCTCGACATC ATGGATGTTG ATGGTAGCCC CAGCTTTGAT TGTCTTTCTA 
446601 CCAGTCAACC ACGCAAAGAA GGAGGGAGAA AACTCTTTTG GAGAATTCCT 
446651 TGGCCCAATG GACCAGGCAT CTCTTTTCCT .CAGCTAAGCT TTCCTTCTTG 
446701 GTATGTTACA GGGAAAGGGG AAGCAGTTTG CCAGACAGCA AGTCAACCCG 
446751 GTCTTGGAAT GAGGGGAAGG AGGTTTCCCT CGGTCCCAGT AACACTCTAG 
446801 CAGGGCCAAG CATTGAAGGA GTCTGCATTC AACTTGCAGA ATCTGTTTAG 
446851 CACTTTCTAC ACCTCCATCA GGATGTATTT TATTCTCTGG TTTTAGTTTC 
446901 CTATCTGGTT TGAAGAGGAG AAAAATAAAA GTCATGAAAC ATATAGCAAG 
446951 CACCTGCTCC CTTTGACATA AGTCAGCTTG TTCAATTCTC ACAACAATCC 
447001 TATAAAAGAG CTGCTATTAT TCCCATTTAA TAACAATAAA ACGGTGGCTC 
447051 AGAGAAGGTA AACATTTTTC TAAGTTCTTA TACTTAATAA ATCTAAATCC 
447101 AAATTTGTTT TTCATTCTTC TTTCCATATC ACACTGCTAT ATAGTTACTA 
447151 TGTTATCATC TCACAATCAC TGATCAACTC CGACCTTTGT CTTTTTTCCT 
447201 GATAAGTCAC CTTGTTCTTG GCATTGCTTT CCATATTCGT GGCTAGCAGT 
447251 GTCCTTCCCA GGGTTAGAGG CAATGATGAG TTCTTTTTCT TGTGATTGCT 
447301 CAGTAGAAAA GGCACGAATG GTGCTGCAAA CCTAAGACAC CTCCAAACGT 
447351 TATACACTTT TGCTAATAGC AGGAAAAATG ACCTCTGGAT GCTGCCAAAG 
447401 TGAGCACTGC CTTTACGTAT GATAGAAAAT ACAATTATAC ATTTTAAAAT 
447451 GATCTCACAG TCTTGCTTTC TAACTTGAAT TCACTGTCAT TTAACTGATT 
447501 TATAAGGATT CTTGGCCACA ACTAAAGTAT ATTTTAAACT TATAAGTATT 
447551 CTCTAATGAG GAACTTTTAA AAGTCCAGAT AAAAGCCCAC TATTCAGTAG 
447601 TTAGAATAAT TCAGTGAAAT TAACCATGTA AATGCTCAAT GAAGCAAATG 
447651 ATTAAAGAAG GAATGAATAA AAATAAAGAT TAAAGTGGAT CAAATTATAT 
447701 TTTGTCTTTT TTTATACTGC TTATAGATCA TAATTAAATA GTGGTGGCTC 
447751 AAAAGCACAA AACCAACCTC ATCACTTACC AGGTGTGAAA ATAATGTTGA 
447801 TGTGTATGCT TAGTTTTAGC GTGACAAAGC AGATACACTT TTTCACTTTT 
447851 AAAAAAAATC ACTCAATGCA TCATTCTTTA ATGTTTGCTT TTGTGATGAT 
447901 CACTGAATAT AGAAAGTCTT TTGATTTTAG AATAATTTTA AGTCTTTGCC 
447951 TGCTGATAAC CTCAGACACA GACCTGCAGC ' CAGAATTGGT TGAGAGTTCT 
448001 GCTATCTTCT CCAATGAGGG ATATTTGAGT AT AT ACT AAA TAATCATATT 
448051 TATATACTGG CATTTAAAAA ACCAAATATT CTCATAGTTC TCTTCAACCT 
448101 CCTGAGCTCC TTAATATAGA TAATATAGAT GGCCACATTT TTGTCATTAT 
448151 ACAGTCATGA AAAGAAGCCT ATGTAGGTCT TATCCACATT TACAAGACAG 
448201 GAACTGAAGA TCAAAGAGGT TAAGTGACTT TCTTAAGTTC CTCAGATAAA 
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448251 CATTTTGAAA CAGCTGAGAT GAAATACAGG TGCGCTGACT TCAAAGCACA 
448301 AGTTCTTTTC TCTAGAGTGC ACTACCCCCA ATGAGTTGAC AAAATTAAAC 
448351 AAAAACTAAG CAGAATTATA TTTTCTTACA CATACTTCAT ACAAGTAATC 
448401 ATTAAGCAGC TACTTGGTAC TAAAGTTTCT CCCAGGAGGT TGCAGTGCAG 
448451 ATGGGTAAGT AAGCTGCTGT ATGTGCCCCC AGGCCATAGG CATGTCAGCA 
448501 GCTATGGATA CCACAGTAAA ATAAATGACT ATCGAGAAAG CAGGAGGAAC 
448551 AGCTCACCAG GGGAGCACAG TGATGAAGTG AAAGCTTCAC CTTTCAGGAA 
448601 TTCTACACAG CCCATAAAAA AAAAAACTCT ATCCCCTCCC TTTCAGAGTG 
448651 TTTCTTAACA ACTGCTTAAC ACAGAAATCA TCAACGTATT GTTGTAGATA 
448701 CTGGGATTAA TATGTGCAGA ACTTCAAACA GAACCCAGAT CTCATGACTC 
448751 CAAGTAGAGG ACCTGTTCCA ATAACCCATG AAAATAATCA CAACATCCTA 
448801 ATCATGTTTT CATCCCTGTT TTACTTTATT AGTTGTTAAT AGGTCCATTG 
448851 GTCACTGGGA CACCCACATG GGGGTTATAT ACCTGTTCAT ACACAAGCTA 
448901 TGAGTCTTTG AAGACAGAAG CATATGAGGT CCTGATTTTG CTCTAGTATG 
448951 AACTCAAACA AATTTTTTTT CAGACTAGTC ACCCTCATCT TTCAAAACAT 
449001 CAGATCATTT CAGAGCTGCT TCTAGGACAT GTGAACTTGG GCAGAGAGAA 
449051 TTGCTTTTTC TTCACTGTGT TACTGTAGTG TACCAGTCAT ATGGCTATTC 
449101 TAAGATCTTT AATCCTCACT TCCTTGTCAC CCTCATGAAG GGTCCTTCAA 
449151 ACTTCTTGAT GACAGCCAGG GGCGAGGGAG GGCACTCCAG GCAGCAAGCT 
449201 GACACAATTT ACAAAAGCAG TTATCTTCTC TCCCGTGAAG CAATGCCGCC 
449251 TGGTAGAACA AATGGTGCAA CCCATGCACA CTTAGTTTAT CCACCTTGAT 
449301 AATCTAAGTG TCAAGGACAG AGCAAGTACG ATTCATAAGA CTCTCTGGAG 
449351 CATGCAGGAT ACTGACAAGC TTTTTATTTA TGACCCAGGG TTTAACCCAT 
449401 GTGAGCTCAT ACTTTGTCTT CTGCTAAGCC TTCCATAGCT CCATCTCAAG 
449451 TCCTCCTTTA AGGTGAGGGT CACTGGACTG CATATCATTC AGTTCCCAGC 
449501 TAGTTTTCTC TTCATTTATT TGTGATTTTA CTCAGATTAT GATATTATTA 
449551 GTTTAGAAGG GGTTGCTGTA GTCAGGACAA TCAGTAATGC ACTTCTAAAT 
449601 ACTAGGTAGA TCCTCCATTC TTTGTGGTTA AACTCCTTCC AGGTTCTAGT 
449651 TTGTTTGTTT ATTTATTTAT TGGAAGATTC ATACACTTTG CCTTTCTTTT 
449701 CACCACCTTG GTTTGGTTTA CAAGAACCAA AGCTAGTTTC TCTAAACCCA 
449751 CTTGATTAGT GTGTGTTTGT GTGTGTGTGT ATGTGTGTGT GTGTATGTGT 
449801 GTATATATAT ATGTGTGTGT ATATATATAT ATATATTTTT TTTTTTTTTT 
449851 TTGAGACAGA GTCTTGCGTT GTCATCCAGG CTGGAGTGCA GTGGCATGAT 
449901 CTCGGCTCAC TGCAACCTCC ACCTCCCAGG TTCCAGCGAT TCTCCTGCCT 
449951 CAGCCTCCCA AGTAGCTGGG ATTACAGGTG CCCGCCACCA TGCCCAGCTA 
450001 TTTTTTTTTT TTTTTGTATT TTTAGTAGAG ATGGGGTTTT ACTATGTTGG 
450051 CCAGGCTGGT CTCAAACTCC TGACTTCGTG AACTGCCTGC CTCGGCCTCC 
450101 CAAAGTGCTG GGATTACAGG TGTGAGCCAG CGCGCCTGGC TTGATTAATA 
450151 TATTTTATTT CTACTACATC AGACACAGAT TCTTTTTGCC CAACATACAT 
450201 AACCATGTAG TCCTACCTAA TATAGTTTCC TCTGTGTGTG TCTGTGTGTG 
450251 TGTGTGTGTG TGTGTCTGTG TGTGTGTGTA TTATATATAC AACTAGTAGG ' 
450301 AAACCCCACA TATCCACATA TCTTTCTAGA TATCTTATCT CCCTATTACA 
450351 AAACTCCTCT CTCTTGATTA TTACACTTTA TAATTTTACT CCCTATGTCA 
450401 TCCAAAGTGG GACTCTCATT GCTGATGGTC TAAGTCAGTG AGACCCTGTT 
450451 GCCTTGGCCA TATCAGAGTC TCTGGATGCA GGGCCCTGGA ATCACCATTT 
450501 TTTAAAAGCT ATGCAGTGAT GTTACTGTGA AGCCAGGTTC ACATTTTCAG 
450551 CCTTTAGAAA AGGCCTGTTG ACCTAAGTTG GAGAATGCTA GCCTTGTCCC 
450601 CAAAATTGCC TGTCTTCTGT AGTTATTCAG TCAATATTTA TTGGCAACAA 
450651 TCATGCAATA AGCAACTCAA ATGAAATATC CTGGCTAATG TGGGACCACT 
450701 CTGGCACCTC TGAAGTTAGA GGACGTGCTA CTAACATTTA TCATTTCAGA 
450751 TCTTTCTGAT CGAGTGGGTA AAAAAAGATA CTGGACCATT TTAGTTGCTA 
450801 TTTCACTGAT TTTCTGAGAC TAGGCAAACA AATACACAAA GTTTTCTCCC 
450851 TCCTTCTGCT GTTCAGTCAT CACAAAGAAA AATTAAGGCC TTTGTATATA 
450901 CTGAGCAAAC TGTTGGGGTA AAGTTATAAA AATGGCAGAC AGATCTTACT 
450951 TTGCCAAATT AAGACATTTA GCAAGTATTG TAAGTGTGAT GAGTGACGGG 
451001 AATAAGTGCG GGACACACTT GCTTTGTCTG ACAGATGTAC TAACCTGATT 
451051 TTGATCTGCC AGAGTACAGG ATTCATTCCT TCCAGGTGCA TTATTCCAAA 
451101 GTGCTTTTGG CTAAAAGGCC TCTTAAAACC TCAATTTATT GTTCTCTATA 
451151 TAGTATATTT CTATTTGCCA TCTCTCCATT ATTTTCACAG CTATCTTGTG 
451201 TCCTAAGTTG GGCCCTAAGA AAAACAAACT GTCCTAAACC ATAGGCCAAG 
451251 ATACCTGAAG AACTGAGGTG GTAGGGCATG GATTTTGGGA AAACTTTTTT 
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451301 ATGGAAATCT CTGTCTTGGA ATTTCTCTCA ATTAGCAGTT CTTGTACTGT 
451351 TTACAGCACG AGAACTGAAA GCATAACCTG CTTCTCTTAT GAGGTAAACT 
451401 GCATATTCTG GGTGAATGTT TTCTCTCCAC AAAAGCCATT TCACATGTTT 
451451 CCAGATCAAT TTTGAAGTTG TTTTGACACT TTACACACTT TATTTTAACC 
451501 TTTTGTTTCT GAGTGTAGTA TCCTCACCTG TCTACAGCAT GAGGAATTGA 
451551 AGGCTCTACC CTGTCTGTTT GCCTGGCAGA GCTGGGGTGA CTCTGCTGCT 
451601 TACACACCAG CCTTCCCACT AGACTGAAGC TTGCCAAGGC AAGGGCTCTG 
451651 AATGTGGAAG TGTTTCCCTT TCCAGGGCCC TAAATATTTC TAACTTGTAT 
451701 GGCACCTGGG AACAAATCAA ATTATCTTCT TGACTCCAAA TGAACCTCCA 
451751 GGCCTTCTTA GAAAAGCTGA AAGAGGAATT AAAGGGGGAT AATGGCTCAT 
451801 GCCTGTAATC TCAGCACTTT GGGAGGCAAG GCAGGCGGAT CATGAGGTCA 
451851 GGAGTTTGAG ACCAGCCTGA CCAACATGGT GAAACCCTGT CTCTACTAAA 
451901 AATACAAAAA CTAGCTGGGC GTGGTGGCAT GCACCTGTAA TCCCAGCTAC 
451951 TCAGGAGGCC GAGGCAGGAG AATTGCTTGA ACCCAGGAGG CGGAGGTTGC 
452001 AGTGAGCTGA GATTGTGTCA CTGCACTCCA GCCTGGGTGA CAGAGCAAGC 
452051 CTCAGTCTCA AATAAATAAA TAAATAAATA AATAAATAAA TAAATAAAAA 
452101 GGGGGAGAGG GGATCAAAAG CACTAGTCTG CATCCATTCA GAGCTATGTG 
452151 TGCAATGGGG AGTAAAGCCA GCCTAGAACC TGTGCATCAT CCCAGGTGTG 
452201 TTTCCAGTCC TCCCTTCCCA CCATGAGACA AAATGAATGG AGACAAATTT 
452251 AGATCCTCTA AAATGAGAAC AGGAGGCCAA AATGGTGGGA ATATATAAAC 
452301 GACAAGGGCA CATATCACTT GCTTTACTTT AGTCCTGAGA TGCCCAAGGC 
452351 CAGACTCCAT GGATGAGTCT TGGGGCACTT TTGGTCCCAC TCTCTTAGCC 
452401 GTCTCCCAGT TCCTGCAATT CAGAGGAACT CCCCAAATTA ATTAATCTGA 
452451 TGAACTCTGT TCTCGAGATG ACAGTATCTC CATATTATCT GTTCCCAAGT 
452501 TTCCTTTCAA AAGGGACAAA TGAACAAGGA GGAGACAGCA CTAAGTCCTA 
452551 CTCATTTCCA AGTAAAGCAT ATATATGCTA TTTCCCAGCT TGGGGGCAGG 
452601 CTGGGCACCC TTTCAGTATT TTATGTCTGC ATCTTTGAAT GAGTCTGGTT 
452651 GTTCTCTCAA GGGAGTTTTA CTTTATGGCT GAAAGGAAAC CTGAAGCTGA 
452701 TTTCTTGGCC TTTCTCTTTA ACTCATTCCT CTTTCTCCCA CCAAGGTGTC 
452751 TGTAAAGATT CTCTCTGGAA TTGCCTCGCT TCCTAAAAGA TATTTTAAAC 
452801 ACTAGGAGTA GTGAATGCCT ACTCATTTTA TCAAGGAGGA AACTGAAATT 
452851 CAAATCAGAG GAGGGCCTTG ACTAAAGTCC CATGGACAGT TAGTCACAGT 
452901 TCACACAATG GAAGTCCAAG CCCAGTTCAC TTTCTGCTAC ACTGCCTTGC 
452951 TTGCCATAAC ATCATCTTTC AATTAGCCGT GGTCGTTGCC TCCTATGTAT 
453001 ATTCAACAGA ATCCTGAAGA AAGCAAGCTG AACACGTTTC CAAATTAGCG 
453051 AGAGCAGCTC TGAATAAGAA GAAAATTCAT GGTAAGTTTA TGGGATGGCT 
453101 CCAGGTGTCC ATCTTTCTAT CTTCCCTTGT TATCACTTTC CCTGTTTTGG 
453151 GTGTAGGGGG AGAAAGAATT TCCCATTGGC TGTCTCCTGG GGTGAGCAAA 
453201 AACTGAATGT GTTTAATCTA GGTTTTCCTC AGATTCCACG ACTTGTAATA 
453251 AGCACATCTT TATACTCTTA CCTCTCCAGC TGTTCTCTGT GCAAAGCTGA 
453301 ACTAACTAAA ACTGCTGAAA ATACCAGAGC TATTTAGGGA ATTTCCCTAA 
453351 CTGCTTCAAC ATTCTTAAAG TGACTAATTT AATTGGATTT ATAGGGCACT - 
453401 ACATAATAAT GATATCCTTT GGCTATGTTC TTAGTGGGTT CTACACCTGT 
453451 AATGAAATCC ATAATGAAAA GGAAAACAGA AGTACATTTG GGATGCAAAA 
453501 CCACTTAATT TCCTGGACAG CCATTAAAAA TGTGGCATCC AGTGCAATAC 
453551 CTTAACTTTG ATTTTACCTT CAGACACCTA TGATGATGAT TCTGGTTGTT 
453601 GGTTGTGGGA CATTAGAATA ATTGTGTCAG AAGAATTCCC ATCGCATCCT 
453651 AGATTCACAC AGCATTTCTC TTACTGGAAA CCCAATTCAT TAGTCATCTG 
453701 AGTTGGAAAA CTGGGACTCT AACACTGGAT GTTGATTTTA TTTATTTAAT 
453751 CTTTTGTGTT CTGGTCTTAT TTCTGTTGTA TTTGGGGGGT- CCTGTCTGCT 
453801 ATCTCCTGTT CCTGGTGTTG AGGTGTAGTG TTCTATGGCA TTTCTGCATG 
453851 TCCTTGGACT ATTTTCCAAG GAATATTTGT TTAACAAACA TCCTTCAAAG 
453901 ATAGAGGCAG TATAACCATG AAATGTCTCC CTCCAGGGCA AGGTTAGACA 
453951 GGTTTGCTTG AAGCTCCCTT CATAGATTGT TTTCTAGCTT TGGGATTTCT 
454001 TGGTTTTGGC AAAATATCTA TCCTGTGAGT TGTATCACCC TGAGTCCACT 
454051 TCTACGTGAC TCCCACTCTC GTCCTCCCCA TGACACTTGA GTGACAAGGG 
454101 CAACTGATCA AAACAGAAAG CTCATGCTGC CTCCTGTGTT CTCCTTAATA 
454151 AAGTTCATTG TCTCTGATCC AGGAGTCTCA TGTCTTCTGA GATCATCTAT 
454201 GAAACTAGCA GGTTAACTTG TTGGCTAGCA AGTAGGGTTA AATCTCAGAC 
454251 CCTTCACAGT TCTTGACACC CAATTTCTTC TTTTTrGGAG CCTTAGCCTT 
454301 AGATCTAGAT GATTAAGTCC TGCTACATGA TCTTCATCCC AGTATTTCTC 
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454351 CCTTGCTTAA GTGGCTCAGC 
454401 TACCCACTAA ATGTGGTTCC 
454451 TTTGTAGTTC TCTAGAATGG 
454501 CATTCTAATT ATTTGATAGT 
454551 AGTCGTTGTA TCTAAATTCA 
454601 GTATAAGAAA TGGTCTAGAT 
454651 TCCAGAGTCA GGTGCACCCA 
454701 GAGTGGTACA TACATACAAA 
454751 TATCCATTTA AGTAGGTAAC 
454801 AAAATATGCC AAATTTTCTC 
454851 GCAAGTATGA TCCTTCTCAT 
454901 GTGGGCATTA GTGCTATAAA 
454951 GTGCTGGTAA TTTACAAAGT 
455001 AGTACCAGGA AAATCTGTAA 
455051 AAAACTGTGG TACTTGTTAT 
455101 ACCTAGGAGG AATTTTTTCT 
455151 ACACATTAGA AACACCAGAA 
455201 GCCACCCCAG ATTCTCACTT 
455251 TTTTTCTTTT TTTTTTGAGA 
455301 TGCAGTGGCG AGATCTCGGC 
455351 CCATTCTCCT. GCCTGATTCT 
455401 ACCATGCCCG GCTAATTTTT 
455451 ACTGTGTTAG CCAGGATGGT 
455501 CTCGGCCTCC CAAAGTGCTG 
455551 TCATACAATT AAAATATGAA 
455601 ATTTTAAACT GTCTAGAATT 
455651 AACTTTTATT TTCCCAAATA 
455701 TACTTTTATT TAGTAGTTTT 
455751 CTTATAAATT CCATATATCT 
455801 ACTTGGTTTA TTTGACAACT 
455851 TGCAGATTTA TGGAGTGTTA 
455901 TGCACTATTT TTTCAAAATA 
455951 AGAGAAAAAT GTAGGAGATT 
456001 GCCATAGGGA ACCAGTCAGT 
456051 TTTGATTGGT CACACCTGAG 
456101- AGAGCACACG ATGTGAGAAC 
456151 GTGTCAGAAC TATAGAGTGA 
456201 TGAAGGGCAG GGATGCCTGT 
456251 TATCCCTAAT TTTATTTTGC 
456301 TAATTTTAAA TGGTAGTGCT 
456351 GTGGTTACAA ATGAACTATT 
456401 AAGTGTGCTT TTCAATACTG 
456451 GTAAAATTTC CCTACATTAA 
456501 TTACATGAGA GAAAAGGAAT 
456551 TAGATGGGAG AATTCTATTA 
456601 GAATTGAATT GGAATTTCAA 
456651 AAAATCACCT TCTCATCATC 
456701 TACATAACTG CTTCAATAAA 
456751 TGTTTTTTCC TATCTCAGGG 
456801 AATTCCCTCA CCTTCTTTTT 
456851 CATTCAATAT TTTCCTGGTA 
456901 GAGTGACATT CAAGCTCCTT 
456951 TGGTCTATCT CCTAGTAAGC 
457001 CCTATTGTTT ATGGTTGTGT 
457051 GCCCTCCCAC TCAGGAGTAA 
457101 TTGACTTGTA CTTAGTCACA 
457151 ATGTGTTGTG TCTGGATCTG 
457201 AGTCTGCATT ACTCCAGGGT 
457251 TCATGGAACT TTTGCGTGAG 
457301 CAAGCTTTAC CCTCATCAAA 
457351 GTTGCAACCA CCCCTCATAC 



CCATATTTCC AGTACCAGTT CCCACTAAAT 
CTCCCTGTGG TCACAAGGAA ATAGAGAATT 
GCAGGACAGA CATATTTGAA TGTATTTGCC 
GACAGTCTAG GGCTTATGAT GGAGGGATTC 
GGGTGAAAGG ATTGCATGAT CAATAGGTAG 
ACTTAAAGTC CAAAACATCT GCATCAGGGC 
ATGCTAAATG ATATGAAGAA CTGGTAATTA 
GTGAGCTATT TTTGATGATA GTGAAGCCAC 
TACTCTGGTT ATCACAGAGG AAAATGTGGG 
CAATCCTTTC TGTAATTCCT CTGTAGACGT 
GAGGTTTAAG GGTGGAAAAG CACAACTAAC 
GCCACTGTTA ATTTAACAAG TTTATAGAAT 
TTAAAATGGT GACATTTCTG TCTATTTTCC 
GTCTCCTGAA CATTTAACTT TTTAAAAATA 
TAACATATTC TTTGCTTTGC CTGTTAGAAA 
ATTAACCAGA AACAGTTCTC AAACTCTGAT 
AAATCTTAAA ATTACTGATA CTTGGGTTTC 
AATTGGTATG TAATGGATCC TGAGATTTTT 
TGGAGTCTTG CTCTGTCACC CAGGCTGGAG 
TCACTGCAAC CTCCGCCTCC CGGGTTCACG 
CCCGAGTAGC TGGGACTACA GGCACCCGCC 
TTTCTGTATT TTTAGTAGAG ACGGGGTTTC 
CTCGATCTCC TGACCTCATG ATCGGCCCGC 
GGATGGATCC TGAGATTTTT TGAAAGTTCA 
GCAATGATTT GAAAAACACT ATTTTATAGG 
CAGTTTTGGA CTAGTATATA GTAGGGATCA 
GCCAACTTTT CCAATGTTAT TTATATCTTC 
TTTTTCATCA TTGATCTTTA AAATGATATC 
GAATCTGTTT CTGGATTCTC TATGTTTTCC 
TTTTAACCAG TATCACACCA TTTAAATTAC 
TAATAACTAG GAGAACAAGT TTTGCCTTAT 
TTATTGGCCA TGTTATACAT TTTATTTTTC 
TCGATCAAGT TTCATCGTCT GATGTTTTGG 
CTCTGTATTC CAGTTTGTGA AGTACGGTGT 
TCTTATGATC TAACCTTGGA GCACCGTCTG 
AAGAAAGGTG GTTTTGGGAG ATAGAATGTG 
GAAAAATTAG AGATGGGGAT CCTGTTACCT 
TTTGATGCCT GAAGCATCCA AACCCAGCAA 
TTTTATGGTA CCAAGAATGA GTCCAGGAGA 
CCAAAAAGTG ATACATGTAC GCTACATGAA 
GTTGTTCATG AACTCCACAT ACCCATCATT 
CATTACTGTG GCTGCCTTCA AGATTGTGCT 
CTTTTCACTT ATTCAGAGCA TCCCATTAGT 
TAGAAAGATA AGGAGTCTAA GAAACCTTAA 
AGTAAGCATA ATCAGGCAAG GAGTCTCTTG 
GGTCACAAGT CCTATCAATC CGCTCCCTCT 
CTAGCAGATG AGTCCTCCAT CCCCACATCC 
CTCTCATTAA GGATTCCATT GAAGATACCA 
CCTTGGTACA TACTCTTTTT ACTGCCTAAA 
TTCCTTTTGT CTAGCTAACT ATAACTCATA 
GTATTTTCTT AGAAATACTT AGGGAGCTTA 
CATAGTTTGA CTACCACCTA TGTATTGTTT 
CAGGCTAGTC ACCTAAATGC CCTCCTTTCT 
CACTXGCATA CATTACCTGC TTGTAGAGCA 
TAGGTTTTAT TCCTACGAAC TAACATGCAC 
TTTTGACTAG AATTACAGGG ACTTGAAGGT 
CCCATATGTT CAAGGTATTT CTTCCAAGTA 
AAGCGCCAGA CTTCCTCATT TTTCTACCTT 
TGGCAAGGAA TACTCAATGG CAGTGTTTCT 
ATGACCTGGA GACCTTGCTA AAACACGATT 
CCTGTACCCA CCAAGCCTAA ATCCATCTCA 
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457.401 GATGGAGCCC AAGAATTTGC ATTTCTTATA AGCTCCAGTG ATATTTAGGC 
457451 TGCCAGTGCT AAAATCACAC TTTGAAAACC ACTATTCCGT GACACTGTTT 
457501 TAAAAACTGG TTGCATGTTA AAATCTCCTA GAAACGGTTA ATTAAGATTT 
457551 TTTTTTTAAA GACTGTGCTT GGGCTCCCCT CTAGACAAAT CAAGATCTCT 
457601 GAGGATGCCA CTTTTCTCAT GAAAAGAAAA AAATTCAACA ATCCGCAACT 
457651 GTTAGGAGAT TTTAATATGC AGCTAGTGTT GAAAATCACT GTTCATACAA 
457701. ATGGCAGATG AGTAAGCAGA GTCATGAACA AGAGAGCAGG CTTGCTGGGA 
457751 GCCTAGGCCA GAGCAAATGT CATAACCTCC TGAGGCAGCG TGAGTTAGGG 
457801 CATGCATCTC ACATTTTAAT GTGCTTGCCC GGAAGATGAC GATAGTAAAT 
457851 CTGGGATGTC TCTGAAATGC TGCTGGTCTA ACAAGCTCCC AGATGATTCC 
457901 TATCCGTGAA CCCCACTCTG ATTAGCAAAG TTGGGATATC CATGAAGAAA 
457951 AAAGGGTCAG TGCGTGCAAG GGATGACAAA GGCAGGTGAG TGAAATGGGC 
458001 TTGAGAGCCC ACATCCGGTA ATCCTTTAGA ATTTCAAACA TAAGAAAGAA 
458051 GGGGAAGTGC TGGGGGGTGG GTTCCCCTGT GGAGGAAGTT ACTTCTAAGC 
458101 CATCAGTTTA AGACTTTAAG TGAATTTCTG GGGAAAGCAT CCAGTTTTGA 
458151 TAAGTCTGGC CCAGACTAAA TATATAATTT CTACCTAATT AAGGAGGCAG 
458201 GCACACAGGC AAAAAGGTAA ATGCAGCTTC TGGAGGCACA TCCTAACAGA 
458251 ACCCCAGAGA AATTTTATCA AAGTCTGTGG GCTTTCTCCA TCTTCTTAGC 
458301 TGCCTTCATT GAGAAGTAAA TTTCCATGGG AGAAAATTAG GGTTTTGCTT 
458351 CATTCCCAGT AGCTGGCATT AAATTAGTCC TGTAGAATGT GTTTGGAGAT 
458401 CATGTTGATG AGCAGAGATT GTCTAAGCAG CAGGTGTTAC GAATTCATTT 
458451 AATCATTTAA CCACACACCC TGCACATATT TCAGCTTTCT TAGGTTGACT 
458501 TTTATTTGCA AAACACCTAT CTGAAGGTGT GTTATAGTGG ATGTATAATT 
458551 GATGGTGCTT GTCGAATTTT ATGCTCTATA AATGCCAGAT ACATTTTCCC 
458601 CAAGTATGCT CTTTCTTGAT CCATTTCTCT AAAGTAAACA TTTGAAACAA 
458651 ATTTAACTCT TGAGGGATTC TGGCAGGACT CACTTTCTCA CTGATTTTAG 
458701 AGCTTGTGTT CAGGTTGGGG AGAGAGCATT ATAGTTTCCA GACTCAGCTC 
458751 AGTGTAAAAC CATCGTGCCC AGACAACTCA CTTAAAAGGA ACAGGAAAGT 
458801 GCAGTTAGAG AGCCTTTACT CCACCCATCT AGATGCCAGG CTACCACCAA 
458851 TGGGCATCTT GTTTTCCCTC TTTTCTCTCT CATTGGAAGA GAGAGAGAGA 
458901 GAGAGCGAAA GAGAGAGAGA GAGAGAGTGT GTGTGTGTGT GTGTGTGTGT 
458951 GTGTGTGTTT TACTACCTGT AGTAGTTTGA ATGGTGACCT GGTGACCCTC 
459001 AAAAAGGTAT GTCTGCATCT TGAAACCTGT GACTATGACC TTATTTGGAA 
459051 AAAGTGTCTT TTGCAGACGT AATTAGAGAT CTTAAGACAA GATCATCTTA 
459101 GATTATCCAG GTGGATTCTA ATTCCAAGGC TAACTGTCCT TATAAGACAC 
459151 AAAAGAGGAG AAGGCCACAT GATGCTGGAG GCAGAGATGA GAGTGATGCA 
459201 CCTGTAGCCC TAGAAGCCTC CAGACCTGGA AGAGGCAAGG GACGGAATTT 
459251 CCCTGGAGCC TCTGGAGAAG GAAACTTTGT GGAGCCTGTG GAAACCGATT 
4593 01 TTGGACTTCT ATCCTCCAGG ATTGTGACAG AATAAATTTC CATTTTTCTA 
459351 AAAGTCACCC AGTTTGTAGT AATTTGTTAT AGCAAACCCA AGAAAATGAA 
459401 TATACTATGC ATCAGGTTGG GGTCGGGAGG TGGGGCACAG AAAAAGAAAA 
459451 CAAGTTCCCA GGCCCTCCTA AAAGTCATTT TGTTTAACAT TTGATCCATG 
459501 CATTTGTCAT GCCCTCCAAA GGGGACTTCT GCAGAAAAGT TCTCAGTGTC 
459551 CTTCAATAAG ATAGAGCCTT TCTTTGCTCC TACCCTAACC TTGAATCAAA 
459601 TACAGCTTGG TGGGCCCTGA CGCTAGAGGA ATTCTCAGTG GCTGGGCTGG 
459651 CGGAGAGCTA TCTATTTGTG TATAGGGTTG CAGCCAAAGT CTTTAAACTA 
459701 AAACTAAAAC TCTGTGGATT AAATAAAATA CTGGACATGC AAAAGAAGAT 
459751 TCTCTTCGGG AATCCCTGTG TACTAGCCCC TCTGTGTAGT GAATGCTATC 
459801 ACTTTTTCTT CTCGCTTTTC TAGATAGGCA TCTAGAAAGG TGTTCAATCT 
459851 CTTGGGGGTA ATCCATAGGC CTCCAGGGTT ATTTCCAAAA GTCTTGTTGA 
459901 TGTTATCAGT GTTAACACAG AACCTCTTCT GATTAGTGTT CAATCTTACT 
459951 ATGGAATAGC TTTTTTCTCT CAAATATTAT TAGAAAATTA GTAAAACATA 
460001 AAAATGATCA GGTTTGGCCA GTTTTACCCC TACTGTCCTA CTGCTATTCA 
460051 AAGACCTAGC ACTTCTGTAC ATACGAGTAA ACTGCACTCG- AAATGGTATT 
460101 CTCTGTTTCT GCTTCTACTT AGGTTCCAAG GCAGGTAAGC AGTCTAACCA 
460151 GATGACAGGT GAGCAATCTG AGGTGAAGTG ACTTCCCATT ACATGACTTT 
460201 TGACCCCCAT CGTTGTCTTG GAAATACTTT ATGCTAAGTA TTAGAAAGAA 
460251 GGAAAACCTA CTGGCTGTTG CAAACAATTT AAAATAACTT AAAGTGGAAG 
460301 AAAGAAGAAA AAAGCAGAGT GAAATTTTAT AGGGTGGGGA AAACCTGGAG 
460351 GAAACTTTCC TTTGAAAGCC AGGAGTCCAA ACTATAGATA GGATAGTACA 
460401 GAAATTCAGC AATCCCCAAG TGTTCTAGCA GAATTTAAAA CAGAGGATGA 
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460451 AACTTGGAGA ACTCTGGGCT AGTTGTGACC TTCAGATATA CTTTAAAAAA 
460501 ATCAACTAAT ATTTTAAAAT CAGTATTTTC ACATAAGATT ATGACTTTTT 
460551 TTTTCTTTTG AAAATTTTGT AAGATCTGGC AGCATTGGGC CTTTATTGCT 
460601 GCATTGTCAG GAACTGGGCT ATGGGTGTTC CTTCCAGTTG AGGGCTGTTT 
460651 GAGGTCTACT CTGTCATCTA CCTGGCCAGT CCCAGCAAAG CTGGATGTAG 
460701 TTAAACATGG AAGGCACAAC TGGACTAGGG GAATTGGATA AGTCCTTGAA 
460751 CAAGTACCTG AGTTAAAACA TCATAAGTTA AAAGGAGGGG ATACTGGTGT 
460801 GAGTTCTGTT TCTCAAAAAT GGGTGTGGAG TTCTAAAATT AATGTTAATT 
460851 TGGGTTCCAT TTTTTCCAAT TTCCCCACTC TCTGCCATAT TGTATTCTTT 
460901 GACTTAGCAT TAGACTTAGA TAAGTTAGTT AATATTCCTG TAACTTCGGT 
460951 GCTTCATACA CAAAATGGGA ACAATTATAA CTACCTCCTA AGATTTTAGT 
461001 AAGACTGAAT AAGACAATGG ATAACAACGA CACAGCATAA TGCCTGAAGC 
461051 ATCATAAGCA CTTAAATGTT GGTTGCTATT ATGATTGGTA TTCTTTAAAA 
461101 AGTGCTATTA CTTGCCATTA TTGACTTTTG AAACTGGAGG GTTCAAACAA 
461151 ACAAGATGTT GAGAGGCTCA GGC AACCAGT ATGGTTGGAA ATAATAAGAA 
461201 AGTTAGTTAC CTACCGTGAT ACTACACTAA CCCTTGAAAA AACGTGTGGC 
461251 GTTTAAAGAA ATTTCTGCTA AATTTCTGTT TGGCTTCAGC AAAACCTCGT 
461301 GTTCCACTTT CAGAGCATGA AAGAGGCAGT TAAGGCAAGA AAAGCGCATT 
• 461351 TAGAAGTTTA AAGAATGCAA GTATCCTATG GGTTGAGAAA TGAGATTAAT 
461401 CCTTCCTTGA CTAGTGCTCT TTTTACTGGC AGGATAAATT CTTTAGGAAA 
461451 AAGAGTTATG CTGGCTGTGG AAGTAAATAG TGACAGGTGT TCTTTTGTTT 
461501 ATTTTCTATG TATTTTTCAA CCCCAGGTCC ATTTTATTCT CAAATGTAAC 
461551 TTCTTTTCTT AGTCAACCTG CTTTTTGGTA AAATTGTTCT TAAAGCATTC 
461601 AGCTTACATA ATTAAATATG GTTGTGTGGT TACCTTTTAT ATCCGAATGT 
461651 AAGGACCTCC ATATAATCTA ATTATTTTTT CCATTAAAAA AAGATATTTC 
461701 ACAGACACAA GCTATAGGTA AGGATTCATT ACAAAGGAAT CTCATTGTTT 
461751 TAAATGAGCA ATTCCATTAT TTTTGATTCT GCCGAGAGTG GGACCTAATT 
461801 CTGTCAAAGA GAGAAGGTTG TTAAGATTTG GTGGTCAGAA ACACATGACT 
461851 TCCAATCAAA TTTAGCAGTC CTAACTATCG AATTAGGTGT TGTAAGTTTA 
461901 TGTGGGCTTT CCTAAGGTGA ATTGACTACA GGGGGCATAG TTCTATCTAA 
461951 GGAATAATTA GTTATGTTGA GATTCAATGT TATTTGCATA TTTGTGTCTT 
462001 TGAAATTCCT CTGAAAATGA GTAATTTTTA GCAATTATTC TTGGTGTTTG 
462051 AGTCCATTGA GCTAGTCAAA AATTTATAAT TGCAAAAACT TTGGCCAAAT 
462101 TATCAGATAG AGCTAAGATT CGGTTTGGGG TCTTAGGAAT TCCTATTATC 
462151 TCCTTGAACT TTGGGATGTA TCAGTTATTA GAATTTAGAC AGTCAACGAG 
462201 CTTGTTGAAA TTTTTTTTGG AACAAGATCA GTGGAATGAA TGGA^ACAAA 
462251 GAAGGGCATA TATTTGTAGA GGAGGAAAAG GAGTGATTTG CAGAAGATAA - 
462301 AAAAGGAAGA AGATAAAGGA GAGGACGGAA GGGGGAAATA AAATTCACAT 
462351 AAATTGCTTA TCACTGATAT TATAGAAGTA AACTATTTTA TAACAGTATC 
462401 TGCAATTTAT TGTTCACATA GCCTTCAAAG AAAGAAAGAA ATGAAGCAGT 
462451 AAATCACCTT GCCTCAAATT ATAATTTTCC TTACTTGTGC GATATTTATT 
462501 GAGCGATTTT ATGCACAGTG TTGTGTTTAG TGGTGAGGAT ATGTAGCACA 
462551 TAGGACAAAA CTTCCCGTTA TACTGGTATA GTATGT 
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Figure 2a. MEX 4 predicted exon sequence. 



ATGAACCAAACT0K2CAGCGTG 

1 + + + — + + + 60 

MNQTASVSHHIKCQPSKTIK 
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Figure 2c. Full length insert sequence and translation of the foetal brain cDNA 
clone identified by screening with MEX4. The MEX4 sequence is in bold and the 
start codon is underlined. 



^ CTCCGGAGTGAGGAAGCAGCAGAAACAGAAGCAGCAGAAGCAACAGCAGTAGCAGCGGCA 

GCAGCAACAGCAGCAGCCCCTACTGAAGTCCAATAGAGGAGACTTGATCTCTAGTTCATT 
61 + + . • _ 



CTGGAACTCCGCCTGGGATTGTGCACTGTCCAGGGTCCTGAAACATGAACCAAACT6CCA ^ 

121 + + _ + + + + 180 

*NMNQTAS 



GCGTGTCCCATCACATCAAGTGTCAACCCTCAAAAACAATCAAGGAACTGGGAAGTAACA 

+ + + __ + + + 

VSHHIKCQPSKTIKELGSNS 
GCCCTCCACAGAGAAACTGGAAGGGAATTGCTATTGCTCTGCTGGTGATTTTAGTTGTAT 
P P Q R N W K G I A I A L L V I L V 7"c 

GCTCACTCATCACTATGTCAGTCATCCTCTTAACCCCAGATGAACTCACAAATTCGTCAG 
+ + + -f + + 

SLITMSVILLTPDELTNSSE 



AAACCAGATTGTCTTTGGAAGACCTCTTTAGGAAAGACTTTGTGCTTCACGATCCAGAGG 
361 -~ + + + „. + + + 42Q 

TRLSLEDLFRKDFVLHDPEA 

CTCGGTGGATCAATGATACAGATGTGGTGTATAAAAGCGAGAATGGACATGTCATTAAAC 
421 --_--- + + - + . u + + + 480 

RWINDTDVVYKSENGHVIKL 

TGAATATAGAAACAAATGCTACCACATTATTATTGGAAAACACAACTTTTGTAACCTTCA 
481 + +_.:_+ + + + 54Q 

NIETNATTIiLLENTTFVTFK 



AAGCATCAAGACATTCAGTTTCACCAGATTTAAAATATGTCCTTCTGGCATATGATGTCA 
541 + + + + + + 60Q 

ASRHSVSPDLKYVLLAYDVK 



AACAGATTTTTCATTATTCGTATACTGCTTCATATGTGATTTACAACATACACACTAGGG 
QIFHY SYTASY^IYNIHTRE 

AAGTTTGGGAGTTAAATCCTCCAGAAGTAGAGGACTCCGTCTTGCAGTACGCGGCCTGGG 
VWELNPPEVEDSVLQYAAWG 
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GTGTCCAAGGGCAGCAGCTGATTTATATTTTTGAAAATAATATCTACTATCAACCTGATA 
721 + + + + + + 780 

VQGQQLIYIFENNIYYQPDI 

TAAAGAGCAGTTCATTGCGACTGACATCTTCTGGAAAAGAAGAAATAATTTTTAATGGGA 
781 + + + + + + 840 

KSSSLRLTSSGKEE IIFNGI 

TTGCTGACTGGTTATATGAAGAGGAACTCCTGCATTCTCACATCGCCCACTGGTGGTCAC 

841 + + + + + + 900 

A D W L % Y EE E LLH S H I AHWW-S P 

CAGATGGAGAAAGACTTGCCTTCCTGATGATAAATGACTCTTTGGTACCCACCATGGTTA 
901 + + + + + + 96Q 

DGERLAFLMINDSLVPTMVI 

TCCCTCGGTTTACTGGAGCGTTGTATCCCAAAGGAAAGCAGTATCCGTATCCTAAGGCAG 
961 + + + + + + 1020 

PRFTGA LYPKGKQYPYPKAG 

GTCAAGTGAACCCAACAATAAAATTATATGTTGTAAACCTGTATGGACCAACTCACACTT 
1021 + + + 4- + + 1080 

QVNPTIKLYVVNLYGPTHTL 

TGGAGCTCATGCCACCTGACAGCTTTAAATCAAGAGAATACTATATCACTATGGTTAAAT 
1081 — + + + + . — + + H40 

ELMPPDS FKSREYYITMVKW 

GGGTAAGCAATACCAAGACTGTGGTAAGATGGTTAAACCGACCTCAGAACATCTCCATCC 
1141 + + + + + + 1200 

VSNTKTVVRWLNRPQNISIL 

TCACAGTCTGTGAGACCACTACAGGTGCTTGTAGTAAAAAATATGAGATGACATCAGATA 

1201 + + + + + + 1260 

TVCETTTGACSKKY EMT SDT 

CGTGGCTCTCTCAGCAGAATGAGGAGCCCGTGTTTTCTAGA 

1261 + + + +- 1301 

WLSQQNE E-PVFSR 



WO 02/086113^ — PCT/GB02/01887 

157/219 



Figure 2d. Full length insert sequence of the 3' RACE clone 

1 TGAGATGACA TCAGATACGT GGCTCTCTCA GCAGAATGAG GAGCCCGTGT 

51 TTTCTAGAGA CGGCAGCAAA TTCTTTATGA CAGTGCCTGT TAAGCAAGGG 

101 GGACGTGGAG AATTTCACCA CATAGCTATG TTCCTCATCC AGAGTAAAAG 

151 TGAGCAAATT ACCGTGCGGC ATCTGACATC AGGAAACTGG GAAGTGATAA 

201 AGATCTTGGC ATACGATGAA ACTACTCAAA AAATTTACTT TCTGAGCACT 

251 GAATCOTCTC CCAGAGGAAG GCAGCTGTAC AGTGCTTCTA CTGAAGGATT 

301 ATTGAATCGC CAATGCATTT CATGTAATTT CATGAAAGAA CAATGTACAT 

351 ATTTTGATGC CAGTTTTAGT CCCATGAATC AACATTTCTT ATTATTCTGT 

401 GAAGGTCCAA GGGTCCCAGT GGTCAGCCTA CATAGTACGG ACAACCCAGC 

451 AAAATATTTT ATATTGGAAA GCAATTCTAT GCTGAAGGAA GCTATCCTGA 

501 AGAAGAAGAT AGGAAAGCCA GAAATTAAAA TCCTTCATAT TGACGACTAT 

551 GAACTTCCTT TACAGTTGTC CCTTCCCAAA GATTTTATGG ACCGAAACCA 

601 GTATGCTCTT CTGTTAATAA TGGATGAAGA ACCAGGAGGC CAGCTGGTTA 

651 CAGATAAGTT CCATATTGAC TGGGATTCCG TACTCATTGA CATGGATAAT 

701 GTCATTGTAG CAAGATTTGA TGGCAGAGGA AGTGGATTCC AGGGTCTGAA 

751 AATTTTGCAG GAGATTCATC GAAGATTAGG TTCAGTAGAA GTAAAGGACC 

801 AAATAACAGC TGTGAAATTT TTGCTGAAAC TGCCTTACAT TGACTCCAAA 

851 AGATTAAGCA TTTTTGGAAA GGGTTATGGT GGCTATATTG CATCAATGAT 

901 CTTAAAATCA GATGAAAAGC TTTTTAAATG TGGATCCGTG GTTGCACCTA 

951 TCACAGACTT GAAATTGTAT GCCTCAGCTT TCTCTGAAAG ATACCTTGGG 

1001 ATGCCATCTA AGGAAGAAAG CACTTACCAG GCAGCCAGTG TGCTACATAA 

1051 TGTTCATGGC TTGAAAGAAG AAAATATATT AATAATTCAT GGAACTGCTG 

1101 ACACAAAAGT TCATTTCCAA CACTCAGCAG AATTAATCAA GCACCTAATA 

1151 AAAGCTGGAG TGAATTATAC TATGCAGGTC TACCCAGATG AAGGTCATAA 

1201 CGTATCTGAG AAGAGCAAGT ATCATCTCTA CAGCACAATC CTCAAATTCT 

1251 TCAGTGATTG TTTGAAGGAA GAAATATCTG TGCTACCACA GGAACCAGAA 

1301 GAAGATGAAT AATGGACGGT "ATTTATACAG AACTGAAGGG AATATTGAGG 

1351 CTCAATGAAA CCTGACAAAG AGACTGTAAT ATTGTAGTTG CTCCAGAATG 

1401 TCAAGGGCAG CTTACGGAGA TGTCACTGGA GCAGCACGCT CAGAGACAGT 

1451 GAACTAGCAT TTGAATACAC AAGTCCAAGT CTACTGTGTT GCTAGGGGTG 

1501 CAGAACCCGT TTCTTTGTAT GAGAGAGGTC AAAGGGTTGG TTTCCTGGGA 

1551 GAAATTAGTT TTGCATTAAA GTAGGAGTAG TGCATGTTTT CTTCTGTTAT 

1601 CCCCCTGTTT GTTCTGTAAC TAGTTGCTCT CATTTTAATT TCACTGGCCA 

1651 CCATCATCTT TGCATATAAT GCACAATCTA TCATCTGTCC TACAGTCCCT 

1701 GATCTTTCAT GGCTGAGCTG CAATCTAACA CTTTACTGTA CCTTTATAAT 

1751 AAGTGCAATT CTTTCATTGT CTATTATTAT GCTTAAGAAA ATATTCAGTT 

1801 AATAAAAAAC AGAGTATTTT ATGTAATTTC TGTTTTTAAA AAGACATTAT 

1851 TAAATGGGTC AAAGGACATA TAGAAATGTG GATTTCAGCA CCTTCCAAAG 

1901 TTCAGCCAGT TATCAGTAGA TACAATATCT TTAAATGAAC ACACGAGTGT 

1951 ATGTCTCACA ATATATATAC ACAAGTGTGC ATATACAGTT AATGAAAC T A 

2001 TCTTTAAATG TTATTCATGC TATAAAGAGT AAACGTTTGA TGAATTAGAA 

2051 GAGATGCTCT TTTCCAAGCT ATAATGGATG CTTTGTTTAA TGAGCCAAAT 

2101 ATGATGAAAC ATTTTTTCCA ATTCAAATTC TAGCXATTGC TTTCCTATAA 

2151 ATGTTTGGGT TGTGTTTGGT ATTGTTTTTA GTGGTTAATA GTTTTCCAGT 

2201 TGCATTTAAT TTTTTGAATA TGATACCTTG TCACATGTAA ATTAGATACT 

2251 TAAATATTAA ATTATAGTTT CTGATAAAGA AATTTTGTTA ACAATGCAAA 

2252 AAAAAAAAA AAAAAAAA 
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Figure 4. Full length sequence of the DPP10 mRNA generated from an overlap of the 
MEX4FB-1 and 3' RACE clones. The MEX4 sequence is in bold and the initiation and 
termination codons are underlined 

CTCCGGAGTGAGGAAGCAGCAGAAACAGAAGCAGCAGAAGCAACAGCAGTAGCAGCGGCA 
1 + + + + f + 60 

GCAGCAACAGCAGC^GCCCCTACTGAAGTCCAATAGAGGAGACTTGATCTCTAGTTCATT 
61 + + + + + + 120 



CTGGAACTCCGCCTGGGATTGTGCACTGTCCAGGGTCCTGAAACATGAACCAAACTGCCA 

+; . + + + _ — + + 

*NMNQ.TAS 

GCGTGTCCCATCACATCAAGTGTCAACCCTC^^^ 

VSHHIK CQP SKTIKELG~ s"n~~S 
GCCCTCCACAGAGAAACTGGAAGGGAATTGCTATTGCTCTGCTGGTGATTTTAGTTGTAT 

PPQRNWKGIAIALLVILVVC 
GCTCACTCATCACTATGTCAGTCATCCTCTTAACCCCAGATGAACTCACAAATTCGTCAG 

SLITMSVILLT PDEL.TNS S E 
AAACCAGATTGTCTTTGGAAGACCTCTTTAGGAAAGACTTTGTGCTTCACGATCCAGAGG 

TRLSLEDLFRKDFVLHDPEA 

CTCGGTGGATCAATGATACAGATGTGGTGTATAAAAGCGAGAATGGACATGTCATTAAAC 
+ + — + + + + 

RWINDTDVVYK S. ENGHVIKL 

TGAATATAGAAACAAATGCTACCACATTATTATTGGAAAACACAACTTTTGTAACCTTCA 
+ + + , + + 

NIETNATTLLLENTTFVTFK 



AAGCATCAAGACATTCAGTTTCACCAGATTTAAAATATGTCCTTCTGGCATATGATGTCA 
541 ~- + + + + _ + + 600 

ASRHSVSPDLKYVLLAYDVK 



AACAGATTTTTCATTATTCGTATACTGCTTCATATGTGATTTACAACATACACACTAGGG 
+ + + + + + 

QIFHYSYTASYVI Y N I H T R E 
AAGTTTGGGAGTTAAATCCTCCAGAAGTAGAGGACTCCGTCTTGCAGTACGCGGCCTGGG 
VWELNP PEVED SVLQ YAAWG 



GTGTCCAAGGGCAGCAGCTGATTTATATTTTTGAAAATAATATCTACTATCAACCTGATA 

721 + — + + + + 780 

VQGQQLIYI FENNIYYQPDI 

TAAAGAGCAGTTCATTGCGACTGACATCTTCTGGAAAAGAAGAAATAATTTTTAATGGGA 
781 + + + + + + 840 

KS SSLRLTSSGKEEIIFNGI 

TTGCTGACTGGTTATATGAAGAGGAACTCCTGCATTCTCACATCGCCCACTGGTGGTCAC 
841 + + +„ + +— + 900 

ADWLYEEELLHSHIAHWWSP 
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CAGATGGAGAAAGACTTGCCTTCCTGATGATAAATGACTCTTTGGTACCCACC ATGGTTA 

**""+ H + + + j. ggn 

DGERLAFLMINDSLVPTMVI 

TCCCTCGGTTTACTGGAGCGTTGTATCCCAAAGGAAAGCAGTATCCGTATCCTAAGGCAG 
961 h ^ _j i . 

PRFTGALYPKGKQ Y~~P~~Y~ p"iT~A~~G 

1021 Q,] ^ GTGAACCCAACAATAAA ^^ 

QVNPT I KLYVVNLY ~g""p~ + T~"h"~t"l 108 ° 

.„., tgga gctcatgccacctgacagctttaaatcaagagaatactatatcactatggttaaat 

1081 — — — h — — +_ y. j ^_ ^ 1140 

elmppdsfksreyy~i~t~ m~"v"k~~w 

1141 GGGTAAGCAATACCAA ^^ 

VSNTKTVVR W~ ~L~ V V"p" "q" V + l"<TT~L 

1201 ^ CAGAG ^ G * GA ^ 

TVCETTTGACSK K + 777V1"7"t 1260 



CGTGGCTCTCTCAGCAGAATGAGGAGCCCGTGTTTTCTAGAGACGGCAGCAAATTCTTTA 
+ + + + + + 

WLSQQNEEPVFSRDGSKFFM 



TGACAGTGCCTGTTAAGCAAGGGGGACGTGGAGAATTTCACCACATAGCTATGTTCCTCA 

— — — — _ _ — _ ^_ < ^_ . ^ _ 

tvpvkqggrgefhh"ia" m~"f"iT~i 
noi tccagag taaaagtgagcaaattaccgtgcggcatctgacatcaggaaactggg 

1381 + + + + + + 144Q 

QSKSEQITVRHLTSGNWEVI 



TAAAGATCTTGGCATACGATGAAACTACTCAAAAAATTTACTTTCTGAGCACTGAATCTT 
+ + + + + + 

KILAYDETTQKIYFLSTESS 



1501 G ^ G ^ GAGGA ^ GGCAGCTGTA ^ GTGCCTC 

prgrqlysast" 7^"l""l""n"* + r""q""c""i 1560 
tttcatgtaatttcatgaaagaacaatgtacat 

1561 + . + + + 162Q 

SCNFMKEQCTYFDASFSPMN 

ATCAACATTTCTTATTATTCTGTGAAGGTCCAAGGGTCCCAGTGGTCAGCCTACATAGTA 
1&21 + + + + . + + 1680 

QHFLLFCEGPRVPVVSIiHST 

1£D1 CGGACAACCCAGCAAAATATTTTATATTGGAAAGCAATTCTATGCTGAAGGAAGCTATCC 
lool 1 — — — 1 |_ w ^_ _ ^ 1740 

dnpakyfilesnsmlk" e~~a~~i~~l 

1741 * GAAGAAGAAGATAGGAAA ^ 

~ ~ -- - ~~ + ~ ~ + + + + 1800 ' 

KKKIGKPE1KILHIDDYELP 

CTTTAGAGTTGTCCCTTCCCAAAGATTTTATGK5ACCGAAACCAGTATGCTCTTCTGTTAA 
±ouj_ }- . f. . . j ^ ^ I860 

LQLSLP KDFMD RNQyV L~~l"l"i 
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TAATGGATGAAGAACCAGGAGGCCAGCTGGTTACAGATAAGTTCCATATTGACTGGGATT 
1861 + + + + + + 192 o 

MDEEPGGQLVTDKFHIDWDS 

1 qoi CCGTACTCATTGACATGGATAATGTCATTGTAGCAAGATTTGATGGCAGAGGAAGTGGAT 
1921 + + + + + + 1 og 0 

VLIDMDNVIVARFDGRGSGF 

1QQ1 TCCAGGGTCTGAAAATTTTGCAGGAGATTCATCGAAGATTAGGTTCAGTAGAAGTAAAGG 

1 y o 1 — — h — — -| , j m +— — -i 2040 

QGLKILQEIHRRLGSV E~~v"lT~D 

ACCAAATAACAGCTGTGAAATTTTTGCTGAAACTGC^ 
2041 + + + + + _„ + 2100 

QITAVKFLLKLPYIDSKRLS 

GCATTTTTGGAAAGGGTTATGGTGGCTATATTGCATCAATGATCTTAAAATCAGATGAAA 
2101 + + + + + + 2160 

IFGKGYGGYIASMILKSDEK 



AGCTTTTTAAATGTGGATCCGTGGTTGCACCTATCACAGACTTGAAATTGTATGCCTCAG 
+ + + + + 

LFKCGSVVAPITDLKLYAS A 



CTTTCTCTGAAAGATACCTTGGGATGCCATCTAAGGAAGAAAGCACTTACCAGGCAGCCA 
2221 + + + + + + 2280 

FSERYLGMP. S KEESTYQAAS 

OOQ1 GTGTGCTACATAATGTTC^TGGCTTGAAAGAAGAAAATATATTAATAATTCATGGAACTG 
2281 h- — • 1 + , ^ + 2340 

V L ■ H N V H G L K E B N I L I I H "g~T~A 

CTGACACAAAAGTTCATTTCCAACACTCAGCAGAATTAATCAAGCACCTAATAAAAGCTG 
2341 + + + + + + 2400 

DTKVHFQHSAELIKHLIKAG 

2401 ^^^^^^ 

VNYTMQVYPD ^7^777^771 

AGTA ^ TCTCTACAGCA ^^ 
2461 + + + + + + 2520 

YHLYSTILKFFSDCLK- EE IS 

CTGTGCTACCACAGGAACCAGAAGAAGATGAATAATGGACCGTATTTATACAGAACTGAA 
2521 + + + + , 9J ron 

VI/PQEPEEDE* 



GGGAATATTGAGGCTCAATGAAACCTGACAAAGAGACTGTAATATTGTAGTTGCTCCAGA 
+ + + + + + 

ATGTCAAGGGCAGCTTACGGAGATGTCACTGGAGCAGCACGCTCAGAGACAGTGAACTAG 



2641 — , -r . , ,. 

+ + + + 2700 

_ ni CATTTGAATACACAAGTCCAAGTCTACTGTGTTGCTAGGGGTGCAGAACCCGTTTCTTTG 
2701 + + + _ + + + 2760 

TATGAGAGAGGTCAAAGGGTTGGTTTCCTGGGAGAAATTAGTTTTGCATTAAAGTAGGAG 

2761 -f 4- 4. i . 

+ + + + 2Q20 

TAGTG^ATGTTTTCTTCTC 

2821 -l ■ — -j j i , _ _ _ 

+ +- + + 2880 
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ATTTCACTGGCCACCATCATCTTTGCATATAATGCACAATCTATCATCTGTCCTACAGTC 
2881 + + + + + + 2940 

CCTGATCTTTGATGGCTGAGCTGCAATCTAACACTTTACTGTACCTTTATAATAAGTGCA 
2941 + + + + + + 3000 

ATTCTTTCATTGTCTATTATTGTGCTTAAGAAAATATTCAGTTAATAAAAAACAGAGTAT 
3001 + + + + _ _„„ + + 3060 

TTTATGTAATTTCTGTTTTT^^ . 
3061 + +" 4- + + + 3120 

GTGGATTTCAGCACCTTCCAAAGTTCAGCCAGTTATCAGTAGATACAATATCTTTAAATG 
3121 + + + + + + 3180 

AACACACGAGTGTATGTCTCACAATATATATACACAAGTGTGCATATACAGTTAATGAAA 
3181 + + + + + . + 3240 

CTATCTTTAAATGTTATTCATGCTATAAAGAGTAAACGTTTGATGAATTAGAAGAGATG 
3241 + + + + + + 3300 

TCTTTTCCAAGCTATAATGGATGCTTTGTTTAATGAGCCAAATATGATGAAACATTTTTT 
3301 + + + + + + 3360 

CCAATTCAAATTCTAGCTATTGCTTTCCTATAAATGTTTGGGTTGTGTTT^^ 
3361 + + + + + + 3420 

TTAGTGGTTAATAGTTTTCCAGTTGCATTTAATTTTTTGAATATGATACCTTGTCACATG 
3421 + + + + + + 3480 

TAAATTAGATACTTAAATATTAAATTATAGTTTCTGATAAAGAAATTTTGTTAACAATGC 
3481 + + + + + + 3540 

AAAAAAAAAA 
3541 3550 
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Figure 5. Mouse DPP10 coding sequence only 



ATGAACCAAACAGCC AGCGTGTC CC ATCAC ATC AAGTGTC AGCCC TCC AAGAC CATCAAG 
1 + + + + +- + 60 

TACTTGGTTTGTCGGTCGCACAGGGTAGTGTAGTTCACAGTCGGGAGGTTCTGGTAGTTC 

MNQTASVSHHIKCQPSKTIK 

GAACTAGGAAGTAACAGCCCTCCACAAAGAAACTGGAAGGGAATCGCCATTGCCCTGCTG 
61 + + + + + + 120 

CTTGATCCTTCATTGTCGGGAGGTGTTTCTTTGACCTTCCCTTAGCGGTAACGGGACGAC 

ELGSNSPPQRNWKGIAIAIiL 

GTGATCTTGGTGGTATGCTCACTCATCACAATGTCTGTCATCCTGTTAACACCAGATGAA 

121 + + + + + + 180 

CACTAGAACCACCATACGAGTGAGTAGTGTTACAGACAGTAGGACAATTGTGGTCTACTT 

VILVVCS- LITMSVILLTPDE 

CTAACAAATTCTTCAGAAACAAGACTGTCACTAGAGGAGCTTTTGGGGAAAGGATTTGGA 

181 + + + + + + 240 

GATTGTTTAAGAAGTCTTTGTTCTGACAGTGATCTCCTCGAAAACCCCTTTCCTAAACCT 

LTNSS ETRLSLEELL GKGFG 

CTTCATAATCCAGAAGCTCGGAGGATCAATGATACAGTCGTTGTATATAAAACCAACAAT 

241 + + + + + + 300 

GAAGTATTAGGTCTTCGAGCCTCCTAGTTACTATGTCAGCAACATATATTTTGGTTGTTA 

LHNPEARRINDTVVVYKTNN 

GGACACGTCATGAAACTGAATACAGAATCAAATGCTTCCACATTGTTATTGGACAACTCA 

301 + + + + + + 360 

CCTGTGCAGTACTTTGACTTATGTCTTAGTTTACGAAGGTGTAACAATAACCTGTTGAGT 

GHVMKLNTESNA STLLLDNS 

ACTTTTGTAACGTTTAAAGCCTCCAGACACTCACTTTCCCCAGATTTAAAGTATGTTCTC 

361 + + + + + + 420 

TGAAAACATTGCAAATTTCGGAGGTCTGTGAGTGAAAGGGGTCTAAATTTCATACAAGAG 

TFVTFKASRHSLSPDLKYVL 

CTGGCATACGACGTCAAGCAGATTTTTCATTACTCATTTACCGCTTCATATTTGATTTAC 

421 + + + + + + 480 

GACCGTATGCTGCAGTTCGTCTAAAAAGTAATGAGTAAATGGCGAAGTATAAACTAAATG 

LAYDVKQIFHYS FTASYLIY 

AACATACACACTGGGGAAGTGTGGGAGTTGAATCCCCCAGAAGTTGAGGACTCAGTGTTG 

481 + 4- + + + + 540 

TTGTATGTGTGACCCCTTCACACCCTCAACTTAGGGGGTCTTCAACTCCTGAGTCACAAC 

NIHTGEVWELNPPEVEDSVL 
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CAGTATGCGGCCTGGGGTGTGCAAGGACAGCAGCTGATTTATATTTTTGAAAACAATATC 
541 +- + + + + + 60 o 

GTCATACGCCGGACCCCACACGTTCCTGTCGTCGACTAAATATAAAAACTTTTGTTATAG 
QYAAWGVQG Q Q L I Y I F* E N N I - 

TACTATCAACCTGATATCAAGAGCAGTTCTTTACGACTAACATCTTCAGGAAAGGAAGGC 
601 + + + + + + 560 

ATGATAGTTGGACTATAGTTCTCGTCAAGAAATGCTGATTGTAGAAGTCCTTTCCTTCCG 
YYQPDIKSS SLRLTSSGKEG - 

ATTATTTTTAATGGGATTGCTGACTGGTTATATGAAGAAGAACTTCTGCATTCTCATATC 
661 + + + + +- + 720 

TAATAAAAATTACCCTAACGACTGACCAATATACTTCTTCTTGAAGACGTAAGAGTATAG 
IIFNGlADW LYEEEIiLHSHI 

GCCCACTGGTGGTCCCCCGATGGAGAGAGACTTGCCTTCCTGATGATAAATGACTCGTTG 
721 + -+ + + + + 780 

CGGGTGACCACCAGGGGGCTACCTCTCTCTGAACGGAAGGACTACTATTTACTGAGCAAC 

AHWWSPDG,ERLAFLMIHDSL 

GTGCCTAACATGATCATACCTCGGTTTACTGGAGCACTGTACCCCAAAGCAAAGCAGTAC 
781 +- + + + + + 840 

CACGGATTGTACTAGTATGGAGCCAAATGACCTCGTGACATGGGGTTTCGTTTCGTCATG 
VPNMII PRFTGALYPKAKQY - 

CCATACCCCAAGGCAGGTCAAGCGAACCCATCAGTGAAGTTATATGTTGTAAACCTGTAT 
841 + + + _ + + + 9Q0 

GGTATGGGGTTCCGTCCAGTTCGCTTGGGTAGTCACTTCAATATACAACATTTGGACATA 
PYPKAGQANPSVKLYVVNLY - 
GGACCAACTCATACTTTGGAGCTGATGCCACCCGACATTTTTAAATCAAGAGAATATTAC 

soi — — i-+ + — __„_ + + + 960 

CCTGGTTGAGTATGAAACCTCGACTACGGTGGGCTGTAAAAATTTAGTTCTCTTATAATG 
G PT HTIi EIiM P P D I FK S RE Y Y. - 

ATCACCATGGTTAAATGGGTGAGCAACACGAGGACAGTGGTCAGGTGGTTGAATAGGCCT 
961 + + + + + + 1020 

TAGTGGTACCAATTTACCCACTCGTTGTGCTCCTGTCACCAGTCCACCAACTTATCCGGA 

IT MVKWVSNTRTVVRWLNR P 

CAGAACATCTCCATCCTCACACTCTGTGAATCTACCACCGGGGCATGCAGCAGGAAATAT 
1021 + + + + __„„„_ + + 1Q80 

GTCTTGTAGAGGTAGGAGTGTGAGACACTTAGATGGTGGCCCCGTACGTCGTCCTTTATA 

QJSTISILTLCESTTGACSRKY 

GAGATGACGTCCGACACCTGGCTCTCTAAACAGAACGAGGAGCCAGTGTTTTCCAGAGAT 
1081 + + + + + + 1140 
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CTCTACTGCAGGCTGTGGACCGAGAGATTTGTC1T 

EMTSDTWLSKQNEEPVFSRD 

GGAAGCAAGTTCTYCATGACTGTTCTTGTTAAACAAGGTGGAAGAGGAGAATTCCACCAC 

1141 + + + + + + 1200 

CCTTCGTTCAAGARGTACTGACAAGAACAATTTGTTCCACCTTCTCCTCTTAAGGTGGTG 

GSKF7MTVLVKQGGRGEFHH 

ATAGCTATGTTCCTGGTCCAGAGTAAAAGTGAGCAAATTACGGTGCGGCATCTGACATCA 

1201 + + + + + + 1260 

TATCGATACAAGGACCAGGTCTCATTTTCACTCGTTTAATGCCACGCCGTAGACTGTAGT 

IAM-FLVQS KSEQ ITVRH LTS 

GGAAACTGGGAAGTGATAAGGATCTTGGCCTATGATGAAACAACTCAAAAAATTTACTTT 

1261 + + + + + + 1320 

CCTTTGACCCTTCACTATTCCTAGAACCGGATACTACTTTC 

GNWEVIRILAYDETTQKIYF 

CTGAGCACAGAATCTTCTCCCCAAGGGAGGCAACTATACAGTGCTTCTACTGAGGGACTA 

1321 + + + + + + 1380 

GACTCGTGTCTTAGAAGAGGGGTTCCCTCCGTTGATATGTCACGAAGATGACTCCCTGAT 

LSTESSPQGRQLYSASTEGL 

TTGAATCGTGATTGCATCTCATGCAACTTTATGAAAGAAGATTGCACGTATTTTGATGCC 

1381 + + + + + + 1440 

AACTTAGCACTAACGTAGAGTACGTTGAAATACTTTCTTCTAACGTGCATAAAACTACGG 

LNRDC iscnfmkedctyfda 

AGCTTTAGCCCCATGAATCAGCATTTCTTGTTATTCTGTGAAGGTCCAAAGGTCCCAGTG 

1441 + + + + + + 1500 

TCGAAATCGGGGTACTTAGTCGTAAAGAACAATAAGACACTTCCAGGTTTCCAGGGTCAC 

SF S PMN'QHFLLF C EGPKVPV 

GTCAGCCTTCACATCACAGAGAACCCATCAAGGTATTTCCTCTTGGAAAACAATTCTGTG 

1501 + + — -+ -f— + + 1560 

CAGTCGGAAGTGTAGTGTCTGTTGGGTAGTTCCATAAAGGAGAACCTTTTGTTAAGACAC 

VSLHITDNPSRYFLLENNSV 

ATGAAAGAAACTATTCAGAAGAAGAAGCTCGCAAAGAGAGAGACTAGAATACTTCACATT 

1561 + + + +- + + 1620 

TACTTTCTTTGATAAGTCTTCTTCTTCGAGCGTTTCTCTCTCTGATCTTATGAAGTGTAA 

M^KETI QKKKLAKRETRIIiHl 

GATGACTATGAACTTCCTTTACAGTTGTCCTTTCCCAAAGATTTTATGGAGAAAAACCAG 

1621 + + + + + + 1680 

CTACTGATACTTGAAGGAAATGTCAACAGGAAAGGGTTTCTAAAATACCTCTTTTTGGTC 

DDYEIiPIiQLSFPKDFMEKNQ 
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TATGCTCTTCTATTAATAATGGATGAAGAACCAGGAGGCCAAATGGTGACAGATAAGTTC 
1681 + + + + — + 1740 

ATACGAGAAGATAATTATTACCTACTTCTTGGTCCTCCGGTTTACCACTGTCTATTCAAG 

YALIjLIMDEEPGGQMVTDKF 

CATGTTGACTGGGATTCAGTTCTTATTGACACCGATAATGTCATTGTAGCAAGATTTGAT 
1741 + + + + + 180Q 

GTACAACTGACCCTAAGTCAAGAATAACTGTGGCTATTACAGTAACATCGTTCTAAACTA 
HVDWDS V L I DTDNV I V A R FD 

GGCAGAGGAAGTGGATTCCAGGGCCTGAAAGTTTTGCAGGAGATTCACAGAAGGATAGGC 
1801 + + + + + 186Q 

CCGTCTCCTTCACCTAAGGTCCCGGACTTTCAAAACGTCCTCTAAGTGTCTTCCTATCCG 

GRGSGFQGLKVLQS-IHRRZG - 

TCAGTGGAGGCAAAGGACCAAGTAGCTGCTGTAAAATATTTACTGAAACAGCCATATATT 
1861 + + + + + + 1920 

AGTCACCTCCGTTTCCTGGTTCATCGACGACATTTTATAAATGACTTTGTCGGTATATAA 
SVEAKD QVAAVK YLLKQ PYI 

GACTCCAAAAGATTAAGCATTTTTGGAAAGGGATATGGGGGCTATATCGCATCAATGATC 
1921 + + + + + + 1980 

CTGAGGTTTTCTAATTCGTAAAAACCTTTCCCTATACCCCCGATATAGCGTAGTTACTAG 
D S KRLS I FGKG.YGGYI A SMI 

TTAAAATCAGATGAGAAGTTTTTCAAATGTGGAGCCGTGGTTGCACCCATCTCAGACATG 
1981 + + + + + + 2040 

AATTTTAGTCTACTCTTCAAAAAGTTTACACCTCGGCACCAACGTGGGTAGAGTCTGTAC 

IiKSDEKPFKCGAVVAPISDM 

AAGTTGTATGCCTCAGCTTTCTCTGAACGGTACCTTGGCATGCCATCAAAGGAAGAAAGC 
2041 — + + + + + + 210Q 

TTCAACATACGGAGTCGAAAGAGACTTGCCATGGAACCGTACGGTAGTTTCCTTCTTTCG 

KLYASAFSERYLGMPSKEES 

ACTTACCAGGCATCCAGTGTACTGCATAACATTCATGGTTTAAAAGAAGAAAATTTATTA 
2101 + + + + + + 2160 

TGAATGGTCCGTAGGTCACATG^CGTATTGTAAGTACCAAATTTTCTTCTTTTAAATAAT 

TYQASSVL.HNIHGLKEENLL 

ATAATTCACGGAACTGCTGATACAAAAGTTCATTTCCAGCATTCAGCAGAATTGATCAAG 
2161 + +- + + + 2220 

TATTAAGTGCCTTGACGACTATGTTTTCAAGTAAAGGTCGTAAGTCGTCTTAACTAGTTC 
I I H k G TAD T KVHF QH SA E L I K - 

CATCTGATAAAAGCTGGGGTGAATTACACTCTGCAGGTCTATCCAGATGAAGGATATCAC 
2221 + + + + + — + 2280 
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GTAGACTATTTTCGACCCCACTTAATGTGAGACGTCCAGATAGGTCTACTTCCTATAGTG 

HLIKAG*VNY. TLQVYPDEGYH 

ATTTCAGACAAGAGCAAGCATCATTTTTACAGCAC^TTCTCAGATTCTTCAGTGATTGT 
2281 + + + — + + + 2340 

TAAAGTCTGTTCTCGTTCGTAGTAAAAATGTCGTGTTAAGAGTCTAAGAAGTCACTAACA 

I SDKSKHHFYSTILRFFSDC - 

CTAAAGGAAGAGGTATCTGTGCTGCCACAGGAACCAGAAGAAGATGAATAA 
2341 + + + + +- 2391 

GATTTCCTTCTCCATAGACACGACGGTGTCCTTGGTCTTCTTCTACTTATT 
LKEEVSVLPQEPEEDE* 
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Figure 5a. Mouse DPP10 cDNA sequence. 



CTTTCTATCCCATATTGCCAAGAGCTCTGA6AATGATGAACCAAGTAAATGCTTTATTTA 
+ + + + + + 

GAAAGATAGGGTATAACGGTTCTCGAGACTCTTACTACTTGGTTCATTTACGAAATAAAT 



GGGCTAAGTGAGCTGCGGATCCCTCCAGCCTGACAGCGTTTCAGCCTGCAGAGTAAGACA 

+ + + + + + 

CCCGATTCACTCGACGCCTAGGGAGGTCGGACTGTCGCAAAGTCGGACGTCTCATTCTGT 



GCAGCAGCAACAGAAGCAGCAGAGGCAGCAGCAACAGCAGCAGCCCCTTGCTCAAGTCCA 
121 ~~ + + + + + 180 

CGTCGTCGTTGTCTTCGTCGTCTCCGTCGTCGTTGTCGTCGTCGGGGAACGAGTTCAGGT 



GCAGGGGAGACTTGATCTCTGGTTAGTCTTGGAACTCAGCCTGTGGATCGTGCACTGTCC 
+ + + + + + 

CGTCCCCTCTGAACTAGAGACCAATCAGAACCTTGAGTCGGACACCTAGCACGTGACAGG 



AGGGTCCTGAAATATGAACCAAACAGCCAGCGTGTCCCATCACATCAAGTGTCAGCCCTC 
241 + + + + + + 300 

TCCCAGGACTTTATACTTGGTTTGTCGGTCGCACAGGGTAGTGTAGTTCACAGTCGGGAG 
*NMNQTASVSHH IKCQPS 

CAAGACCATCAAGGAACTAGGAAGTAACAGCCCTCCACAAAGAAACTGGAAGGGAATCGC 
301 + + + + + + 360 

GTTCTGGTAGTTCCTTGATCCTTCATTGTCGGGAGGTGTTTCTTTGACCTTCCCTTAGCG 
KTI KELGSNSP PQRNWKGIA- 

CATTGCCCTGCTGGTGATCTTGGTGGTATGCTCACTCATCACAATGTCTGTCATCCTGTT 
361 + + + + + + 420 

GTAACGGGACGACCACTAGAACCACCATACGAGTGAGTAGTGTTACAGACAGTAGGACAA 

IALLVILVVCSLITMSVI LL- 
AACACCAGATGAACTAACAAATTCTTCAGAAACAAGACTGTCACTAGAGGAGCTTTTGGG 

421 + + ; + + 48Q 

TTGTGGTCTACTTGATTGTTTAAGAAGTCTTTGTTCTGACAGTGATCTCCTCGAAAACCC 

TPDELTNSSETRLSLEELLG- 

GAAAGGATTTGGACTTCATAATCCAGAAGCTCGGAGGATCAATGATACAGTCGTTGTATA 
481 + + + + + + 54Q 

CTTTCCTAAACCTGAAGTATTAGGTCTTCGAGCCTCCTAGTTACTATGTCAGCAACATAT 

KGFGLHNPEARRINDTVVVY 

TAAAACCAACAATGG^CACGTCATGAAACTGAATACAGAATC 
541 + + + + + 600 

ATTTTGGTTGTTACCTGTGCAGTACTTTGACTTATGTCTTAGTTTACGAAGGTGTAACAA 
KTNNGHVMKLNTESNASTIiL- 
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ATTGGACAACTCAACTTTTGTAACGTTTAAAGCCTCCAGACACTCACTTTCCCCAGATTT 
601 + + + + + + 660 

TAACCTGTTGAGTTGAAAACATTGCAAATTTCGGAGGTCTGTGAGTGAAAGGGGTCTAAA 

LDNSTFVTFKASRHSLS.PDL- 

AAAGTATGTTCTCCTGGCATACGACGTCAAGCAGATTTTTCATTACTCATTTACCGCTTC 
661 + + + +- + + 720 

TTTCATACAAGAGGACCGTATGCTGCAGTTCGTCTA7VAAAGTAATGAGTAAATGGCGAAG 

KYVLLAYDVKQIFHYSFTAS - 

ATATTTGATTTACAACATACACACTGGGGAAGTGTGGGAGTTGAATCCCCCAGAAGTTGA 
721 + + + + + + 780 

TATAAACTAAATGTTGTATGTGTGACCCCTTCACACCCTCAACTTAGGGGGTCTTCAACT 

YLIYNIHTG'EVWELNPPEVE- 

GGACTCAGTGTTGCAGTATGCGGCCTGGGGTGTGCAAGGACAGCAGCTGATTTATATTTT 
781 + + + + + + 840 

CCTGAGTCACAACGTCATACGCCGGACCCCACACGTTCCTGTCGTCGACTAAATATAAAA 
D SVLQYAAWGVQGQQL I Y IF 

TGAAAACAATATCTACTATCAACCTGATATCAAGAGCAGTTCTTTACGACTAACATCTTC 
841 + + + + + + 900 

ACTTTTGTTATAGATGATAGTTGGACTATAGTTCTCGTCAAGAAATGCTGATTGTAGAAG 

EWNIYYQPDIKSSSLRLTSS 

AGGAAAGGAAGGCATTATTTTTAATGGGATTGCTGACTGGTTATATGAAGAAGAACTTCT 
901 — + + + + + + 96Q 

TCCTTTCCTTCCGTAATAAAAATTACCCTAACGACTGACCAATATACTTCTTCTTGAAGA 

GKEGI IFNGIADWLYEEELL - 

GCATTCTCATATCGCCCACTGGTGGTCCCCCGATGGAGAGAGACTTGCCTTCCTGATGAT 
961 + + + + + + 1020 

CGTAAGAGTATAGCGGGTGACCACCAGGGGGCTACCTCTCTCTGAACGGAAGGACTACTA 
H S H I A HWW S P DGERLAF L MI - 

AAATGACTCGTTGGTGCCTAACATGATCATACCTCGGTTTACTGGAGCACTGTACCCCAA 
1021 + + + + + + 1080 

TTTACTGAGCAACCACGGATTGTACTAGTATGGAGCCAAATGACCTCGTGACATGGGGTT 

NDSLVPNMIIPRPTGALYPK- 

AGCAAAGCAGTACCCATACCCCAAGGCAGGTCAAGCGAACCCATCAGTGAAGTTATATGT 
1081 + + + + + + 1140 

TCGTTTCGTCATGGGTATGGGGTTCCGTCCAGTTCGCTTGGGTAGTCACTTCAATATACA 

AKQYPYPKAGQANPSVKLYV- 

TGTAAACCTGTATGGACCAACTCATACTTTGGAGCTGATGCCACCCGACATTTTTAAATC 
1141 + + + -+ + + 1200 

ACATTTGGACATACCTGGTTGAGTATGAAACCTCGACTACGGTGGGCTGTAAAAATTTAG 
VNLYGPTHTLELMPPDIFKS- 
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AAGAGAATATTACATCACCATGGTTAAATGGGTGAGCAACACGAGGACAGTGGTCAGGTG 
1201 + + + + + + 126Q 

TTCTCTTATAATGTAGTGGTACCAATTTACCCACTCGTTGTGCTCCTGTCACCAGTCCAC 

R E Y Y I TMVK WVS N TRTVVR W- 

GTTGAATAGGCCTCAGAACATC TC CATCCTC AC ACTCTGTGAATCTACCACCGGGGC ATG 

1261 + + + + + + 1320 

CAACTTATCCGGAGTCTTGTAGAGGTAGGAGTGTGAGACACTTAGATGGTGGCCCCGTAC 

XiNRPQNISILTLCESTT GAC 

CAGCAGGAAATATGAGATGACGTCCGACACCTGGCTCTCTAAACAGAACGAGGAGCCAGT 
1321 + + + + + + 1380 

GTCGTCCTTTATACTCTACTGCAGGCTGTGGACCGAGAGATTTGTCTTGCTCCTCGGTCA 

SRKYEMTSDTWL SKQNEEPV- 

GTTTTCCAGAGATGGAAGCAAGTTCTYCATGACTGTTCTTGTTAAACAAGGTGGAAGAGG 
1381 + + + + + + 1440 

CAAAAGGTCTCTACCTTCGTTCAAGARGTACTGACAAGAACAATTTGTTCCACCTTCTCC 

FSRDGSKF7MTVLVKQGGRG- 

AGAATTCCACCACATAGCTATGTTCCTGGTCCAGAGTAAAAGTGAGCAAATTACGGTGCG 
1441 + + + + _ + + 1500 

TCTTAAGGTGGTGTATCGATACAAGGACCAGGTCTCATTTTCACTCGTTTAATGCCACGC 

EFHHIAMFLVQS KSEQITVR- 

GCATCTGACATCAGGAAACTGGGAAGTGATAAGGATCTTGGCCTATGATGAAACAACTCA 
1501 -+ + + + + + 1560 

CGTAGACTGTAGTCCTTTGACCCTTCACTATTCCTAGAACCGGATACTACTTTGTT.GAGT 

HLTSGNWEVIRILAYDETTQ- 

AAAAATTTACTTTCTGAGCACAGAATCTTCTCCCCAAGGGAGGCAACTATACAGTGCTTC 

1561 -+ + + +— — + + 1620 

TTTTTAAATGAAAGACTCGTGTCTTAGAAGAGGGGTTCCCTCCGTTGATATGTCACGAAG 

KIYFLSTES SPQGRQLYS AS- 

TACTGAGGGACTATTGAATCGTGATTGCATCTCATGCAACTTTATGAAAGAAGATTGCAC 
1621 + + + + + + 1680 

ATGACTCCCTGATAACTTAGCACTAACGTAGAGTACGTTGAAATACTTTCTTCTAACGTG 
TEGLLNRDC I SCNFMKEDCT- 

GTATTTTGATGCCAGCTTTAGCCCCATGAATCAGCATTTCTTGTTATTCTGTGAAGGTCC 
1681 + + + + — + + 1740 

CATAAAACTACGGTCGAAATCGGGGTACTTAGTCGTAAAGAACAATAAGACACTTCCAGG 

YFDASFSPMNQHFLLFCEGP- 

AAAGGTCCCAGTGGTCAGCCTTCACATCACAGACAACCCATCAAGGTATTTCCTCTTGGA 
1741 -+ + + + + + 1800 

TTTCCAGGGTCACCAGTCGGAAGTGTAGTGTCTGTTGGGTAGTTCCATAAAGGAGAACCT 
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KVPVVS LH ITDNP S R Y FLLE - 

AAACAATTCTGTGATGAAAGAAACTATTCAGT^AGAAGAAGCTCGCAAAGAGAGAGACTAG 
1801 + + + + + + I860 

TTTGTTAAGACACTACTTTCTTTGATAAGTCTTCTTCTTCGAGCGTTTCTCTCTCTGATC 

NNSVMKETIQKKKLAKRETR- 

AATACTTCACATTGATGACTATGAACTTCCTTTACAGTTGTCCTTTCCCAAAGATTTTAT 
1861 + + -f : + + + 1920 

TTATGAAGTGTAACTACTGATACTTGAAGGAAATGTCAACAGGAAAGGGTTTCTAAAATA 

ILHIDDYELPLQLSFPKDFM- 

GGAGAAAAACCAGTATGCTCTTCTATTAATAATGGATGIUVGAACCAGGAGGCCAAATGGT 
1921 + + +. + + + 1980 

CCTCTTTTTGGTCATACGAGAAGATAATTATTACCTACTTCTTGGTCCTCCGGTTTACCA 

EKNQYAIiLLI MDEE P GGQMV- 

GACAGATAAGTTCCATGTTGACTGGGATTCAGTTCTTAT^ 

1981 + + + + + + 2040 

CTGTCTATTCAAGGTACAACTGACCCTAAGTCAAGAATAACTGTGGCTATTACAGTAACA 

TDKFHVDWDSVLIDTDNV IV- 

AGCAAGATTTGATGGCAGAGGAAGTGGATTCCAGGGCCTGAAAGTTTTGCAGGAGATTCA 
2041 + + + + + + 2100 

TCGTTCTAAACTACCGTCTCCTTCACCTAAGGTCCCGGACTTTCAAAACGTCCTCTAAGT 

AR, FDGRGSGFQGLKVIiQEIH- 

CAGAAGGATAGGGTCAGTGGAGGCAAAGGACCAAGTAGCTGCTGTAAAATATTTACTGAA 
2101 r -+ + + +- + + 2160 

GTCTTCCTATCCGAGTCACCTCCGTTTCCTGGTTCATCGACGACATTTTATAAATGACTT 

RRI G SV EAKDQVAAVKYIiLK- 

ACAGCCATATATTGACTCCAAAAGATTAAGCATTTTTGGAAAGGGATATGGGGGCTATAT 
2161 h ' — + + h . + + 2220 

TGTCGGTATATAACTGAGGTTTTCTAATTCGTAAAAACCTTTCCCTATACCCCCGATATA 
QPYID S KRLSIFGKGYGGY I- 

CGCATCAATGATCTTAAAATCAGATGAGAAGTTTTTCAAATGTGGAGCCGTGGTTGCACC 
2221 + --+ + + : + + 2280 

GCGTAGTTACTAGAATTTTAGTCTACTCTTCAAAAAGTTTACACCTCGGCACCAACGTGG 

ASMILKSDEKFFKCGAVVAP- 

CATCTCAGACATGAAGTTGTATGCCTCAGCTTTCTCTGAACGGTACCTTGGCATGCCATC 
2281 + + + + + + 2340 

GTAGAGTCTGTACTTGAACATACGGAGTCGAAAGAGAC 

ISDMKLYASAFSERYLGMPS - 

AAAGGAAGAAAGCACTTACCAGGCATCCAGTGTACTGCATAACATTCATGGTTTAAAAGA 
2341 + + + + + + 2400 

TTTCCTTCTTTCGTGAATGGTCCGTAGGTCACATGACGTATTGTAAGTACCAAATTTTCT 
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KEE ST'YQASSVLHNIHGIiKE 

AGAAAATTTATTAATAATTCACGGAACTGCTGATACAAAAGTTCATTTCCAGCATTCAGC 

2401 + + + + + + 2460 

TCTTTTAAATAATTATTAAGTGCCTTGACGACTATGTTTTCAAGTAAAGGTCGTAAGTCG 

ENL LI I H G TADT KVH FQH SA 

AGAATTGATCAAGCATCTGATAAAAGCTGGGGTGAATTACACTCTGCAGGTCTATCCAGA 

2461 + + + + + + 2520 

TCTTAACTAGTTCGTAGACTATTTTCGACC CC ACTTAATGTGAGAC GTCCAGATAGGTCT 

ELIKHLIKAGVNYTLQV YPD- 

TGAAGGATATCACATTTCAGACAAGAGCAAG(^TCATTTTTACAGCACAATTCTCAGATT 

2521 + + + + + + 2580 

ACTTCCTATAGTGTAAAGTCTGTTCTCGTTCGTAGTAAAAATGTCGTGTTAAGAGTCTAA 

EGYHISDKSKHHFYSTILRF - 

CTTCAGTGATTGTCTAAAGGAAGAGGTATCTGTGCTGCCACAGGAACCAGAAGAAGATGA 

2581 + + + + + + 2640 

GAAGTCACTAACAGATTTCCTTCTCCATAGACACGACGGTGTCCTTGGTCTTCTTCTACT 

FSDCLKEEVSVLPQEPEEDE 

ATAATGGACTACACTTATACAGAGCTGAAGGGAATATTGAGGCTCAATGAAACTGAACCA 

2641 + + + + + + 2700 

TATTACCTGATGTGAATATGTCTCGACTTCCCTTATAACTCCGAGTTACTTTGACTTGGT 



AGAGACTGTCTTGTTGTAGACGCTCCAGAATTCTAAGGGCAGCTTGAGGAGATAAACCTG 

2701 + + + + + 2760 

TCTCTGACAGAACAACATCTGCGAGGTCTTAAGATTCCCGTCGAACTCCTCTATTTGGAC 



GAACAGCACACTCAGACAGTGAACTAGACCTTGGAGACAGACATCCACACTACTGTGCTG 
2761 + + + + + + 2820 

CTTGTCGTGTGAGTCTGTCACTTGATCTGGAACCTCTGTCTGTAGGTGTGATGACACGAC 



CCAAGGGTGCAGAAGGTGTTTCATTGTAAAGGGAAGTTGCAAGGGTTGGTTTCCTGGGGG 

2821 + + + + + + 2880 

GGTTCCCACGTCTTCCACAAAGTAACATTTCCCTTCAACGTTCCCAACCAAAGGACCCCC 



AAAAAAAATTAGTTTCACATTAAAGTAGG 

2881 + + 2909 

TTTTTTTTAATCAAAGTGTAATTTCATCC 
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Figure 8. Mouse DPP10 alternate exons and predicted peptides 

>Mouse alternate exon 1b 



1 CCGCTCCCAC GTTTGCCTCT CTTCATCTTC CCTCCGCCCG ACTCC 



>Mouse alternate exon 1c 



1 CAGGCTCTCT TGTCTCATCC ATGGACCCAG CAGGAACTTA GAGCAGCCTA 
51 GTGCGCTCTG GCGTCAGA 



>Mouse Alternate exon 1d 



1 ATGCTCAAGG TGGAGAGCCG TGGGGAAGCG GGCAGAGAG 



>Mouse alternate exon 1e 



1 ATGACAGCCA TGAAGCAGGA GCAGCAACCC ACCCCAGGGG CCAGGGCAAC 
51 CCAGTCGCAG CCAGCCGACC AG 

>Mouse transcript 1 predicted peptide (796 residues) 

MNQTASVSHHIKCQPSKTIKELGSNSPPQRNWKGIAIALLVILVVCSLITMSVILLTPDELTNSSETRLSLEELL 

GKGFGLHNPEARRINDTWVYKTNNGHVMKLNTESNASTLLLDNSTFVTFKASRHSLSPDLKYVLLAYDVKQ 

IFHYSFTASYLIYNIHTGEWVELNPPEVEDSVLQYAAWGVQGQQLIYIFENNIYYQPDIKSSSLRLTSSGKEGII 

FNGIADWLYEEELLHSHIAHWWSPDGERLAFLMINDSLVPNMIIPRFTGALYPKAKQYPYPKAGQANPSVKL 

YVVNLYGPTHTLELMPPDIFKSREYYITMVKWVSNTRTVVRWLNRPQNISILTLCESTTGACSRKYEMTSDT 

WLSKQNEEPVFSRDGSKFXMTVLVKQGGRGEFHHIAMFLVQSKSEQITVRHLTSGNWEVIRILAYDETTQKI 

YFLSTESSPQGRQLYSASTEGLLNRDCISCNFMKEDCTYFDASFSPMNQHFLLFCEGPKVPWSLHITDNPS 

RYFLLENNSVMKETIQKKKLAKRETRILHIDDYELPLQLSFPKDFMEKNQYALLLIMDEEPGGQMVTDKFHVD 

WDSVLiDTDNVIVARFDGRGSGFQGLKVLQEIHRRIGSVEAKDQVAAVKYLLKQPYIDSKRLSIFGKGYGGYI 

ASMILKSDEKFFKCGAVVAPISDMKLYASAFSERYLGMPSKEESTYQASSVLHNIHGLKEENLLIIHGTADTKV 

HFQHSAELIKHLIKAGVNYTLQVYPDEGYHISDKSKHHFYSTILRFFSDCLKEEVSVLPQEPEEDE* 



>Mouse transcript 2 predicted peptide (789 residues) 

MLKVESRGEAGREELGSNSPPQRNWKGIAIALLVILWCSLITMSVILLTPDELTNSSETRLSLEELLGKGFGL 

HNPEARRINDTVVVYKTNNGHVMKLNTESNASTLLLDNSTFVTFKASRHSLSPDLKYVLLAYDVKQIFHYSFT 

ASYLIYNIHTGEVWELNPPEVEDSVLQYAAWGVQGQQLIYIFENNIYYQPDIKSSSLRLTSSGKEGIIFNGIAD 

WLYEEELLHSHIAHWWSPDGERLAFLMINDSLVPNMIIPRFTGALYPKAKQYPYPKAGQANPSVKLYWNLY 

GPTHTLELMPPDIFKSREYYITMVKWVSNTRTWRWLNRPQNISILTLCESTTGACSRKYEMTSDTWLSKQN 

EEPVFS RDGSKFXMTVLVKQGG RGEFHH IAM FLVQSKSEQITVRHLTSGNW EVIR ILAYDETTQKIYFLSTES 

SPQGRQLYSASTEGLLNRDCISCNFMKEDCTYFDASFSPMNQHFLLFCEGPKVPWSLHITDNPSRYFLLEN 

NSVMKETIQKKKLAKRETRILHIDDYELPLQLSFPKDFMEKNQYALLLIMDEEPGGQMVTDKFHVDWDSVLID 

TDNVIVARFDGRGSGFQGLKVLQEIHRRIGSVEAKDQVAAVKYLLKQPYIDSKRLSIFGKGYGGYIASMILKS 

DEKFFKCGAWAPISDMKLYASAFSERYLGMPSKEESTYQASSVLHNIHGLKEENLLIIHGTADTKVHFQHSA 

ELIKHLIKAGVNYTLQVYPDEGYHISDKSKHHFYSTILRFFSDCLKEEVSVLPQEPEEDE* 

>Mouse transcript 3 predicted peptide (800 residues) 

MTAMKQEQQPTPGARATQSQPADQELGSNSPPQRNWKGIAIALLVILVVCSLITMSVILLTPDELTNSSETRL 

SLEELLGKGFGLHNPEARRINDTVWYKTNNGHVMKLNTESNASTLLLDNSTFVTFKASRHSLSPDLKYVLL 

AYDVKQIFHYSFTASYLIYNIHTGEVWELNPPEVEDSVLQYAAWGVQGQQLIYIFENNIYYQPDIKSSSLRLTS 

SGKEGilFNGIADWLYEEELLHSHIAHWWSPDGERLAFLMINDSLVPNMIIPRFTGALYPKAKQYPYPKAGQA 

NPSVKLYVVNLYGPTHTLELMPPDIFKSREYYITMVKWVSNTRTWRWLNRPQNISILTLCESTTGACSRKYE 

MTSDTWLSKQNEEPVFSRDGSKFXMTVLVKQGGRGEFHHIAMFLVQSKSEQITVRHLTSGNWEVIRILAYD 
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ETTQKIYFLSTESSPQGRQLYSASTEGLLNRDCISCNFMKEDCTYFDASFSPMNQHFLLFCEGPKVPVVSLH 

ITDNPSRYFLLENNSVMKETIQKKKLAKRETRILHIDDYELPLQLSFPKDFMEKNQYALLLIMDEEPGGQMVT 

DKFHVDWDSVLIDTDNVIVARFDGRGSGFQGLKVLQEIHRRIGSVEAKDQVAAVKYLLKQPYIDSKRLSIFGK 

GYGGYIASMILKSDEKFFKCGAVVAPISDMKLYASAFSERYLGMPSKEESTYQASSVLHNIHGLKEENLLIIHG 

TADTKVHFQHSAELIKHLIKAGVNYTLQVYPDEGYHISDKSKHHFYSTILRFFSDCLKEEVSVLPQEPEEDE* 

> Mouse transcript 4 predicted peptide (Plus 800 residues - no start yet identified) 

YTIGLERPPGQVRSHVCLSSSSLRPTPRLSCLIHGPSRNLEQPSALWRQNAQGGEPWGSGQRDELTNSSE 

TRLSLEELLGKGFGLHNPEARRINDTWVYKTNNGHVMKLNTESNASTLLLDNSTFVTFKASRHSLSPDLKY 

VLLAYDVKQIFHYSFTASYLIYNIHTGE>A/VELNPPEVEDSVLQYAAWGVQGQQLIYIFENNIYYQPDIKSSSLR 

LTSSGKEGIIFNGIADWLYEEELLHSHIAHWWSPDGERLAFLMINDSLVPNMIIPRFTGALYPKAKQYPYPKA 

GQANPSVKLYWNLYGPTHTLELMPPDIFKSREYYITMVKWVSNTRTWRWLNRPQNISILTLCESTTGACS 

RKYEMTSDTWLSKQNEEPVFSRDGSKFXMTVLVKQGGRGEFHHIAMFLVQSKSEQITVRHLTSGNWEVIRI 

LAYDETTQK1YFLSTESSPQGRQLYSASTEGLLNRDCISCNFMKEDCTYFDASFSPMNQHFLLFCEGPKVPV 

VSLHITDNPSRYFLLENNSVMKETIQKKKLAKRETRILHIDDYELPLQLSFPKDFMEKNQYALLLIMDEEPGGQ 

MVTDKFHVDWDSVLIDTDNVIVARFDGRGSGFQGLKVLQEIHRRIGSVEAKDQVAAVKYLLKQPYIDSKRLSI 

FGKGYGGYIASMILKSDEKFFKCGAWAPISDMKLYASAFSERYLGMPSKEESTYQASSVLHN1HGLKEENLL 

IIHGTADTKVHFQHSAELIKHLIKAGVNYTLQVYPDEGYHISDKSKHHFYSTILRFFSDCLKEEVSVLPQEPEE 



> Mou se full length transcript 1 sequence 

CTTTCTATCCCATATTGCCAAGAGCTCTGAGAATGATGAACCAAGTAAATGCTTTATTTAGGGCTAAGTG 

AGCTGCGGATCCCTCCAGCCTGACAGCGTTTCAGCCTGCAGAGTAAGACAGCAGCAGCAACAGAAGC 

AGCAGAGGCAGCAGCAACAGCAGCAGCCCCTTGCTCAAGTCCAGCAGGGGAGACTTGATCTCTGGTT 

AGTC7TGGAACTCAGCCTGTGGATCGTGCACTGTCCAGGGTCCTGAAATATGAACCAAACAGCCAGCG 

TGTCCCATCACATCAAGTGTCAGCCCTCCAAGACCATCAAGGAACTAGGAAGTAACAGCCCTCCACAAA 

GAAACTGGAAGGGAATCGCCATTGCCCTGCTGGTGATCTTGGTGGTATGCTCACTCATCACAATGTCTG 

TCATCCTGTTAACACCAGATGAACTAACAAATTCTTCAGAAACAAGACTGTCACTAGAGGAGCTTTTGGG 

GAAAGGATTTGGACTTCATAATCCAGAAGCTCGGAGGATCAATGATACAGTCGTTGTATATAAAACCAA 

CAATGGACACGTCATGAMCTGAATACAGAATCAMTGCTTCCACATTGTTATTGGACAACTCAACTTTT 

GTAACG7TTAMGCCTCCAGACACTCACTTTCCCCAGATTTAAAGTATGTTCTCCTGGCATACGACGTCA 

AGCAGATTTTTCATTACTCATTTACCGCTTCATATTTGATTTACAACATACACACTGGGGAAGTGTGGGA 

GTTGAATCC CCCAG AAGTTGAGGACTCAGTGTTGCAGTATGGGGCCTGGGGTGTGCAAGGACAGCAG 

CTGATTTATATTTTTGAAAACAATATCTACTATCAACCTGATATCAAGAGCAGTTCTTTACGACTAACATC 

TTCAGGAAAGGMGGCATTATTTTTAATGGGATTGCTGACTGGTTATATGAAGAAGAACTTCTGCATTCT 

CATATCGCCCACTGGTGGTCCCCCGATGGAGAGAGACTTGCCTTCCTGATGATAAATGACTCGTTGGT 

GCCTAACATGATCATACCTCGGTTTACTGGAGCACTGTACCCCAAAGCAAAGCAGTACCCATACCCCAA 

GGCAGGTCAAGCGAAC CCATCA GTGAAGTTATATGTTGTAAACCTGTATGGACCAACTCATACTTTGGA 

GCTGATGCCACCCGACATTTTTAAATCAAGAGAATATTACATCACCATGGTTAAATGGGTGAGCAACAC 

GAGGACAGTGGTCAGGTGGTTGAATAGGCCTCAGAACATCTCCATCCTCACACTCTGTGAATCTACCAC 

CGGGGCATGCAGCAGGAAATATGAGATGACGTCCGACACCTGGCTCTCTAAACAGAACGAGGAGCCA 

GTGTTTTCCAGAGATGGAAGCAAGTTCTYCATGACTGTrCTTGTTAAACAAGGTGGAAGAGGAGAATTC 

CACCACATAGCTATGTTCCTGGTCCAGAGTAAAAGTGAGCAAATTACGGTGCGGCATCTGACATCAGGA 

AACTGGGAAGTGATAAGGATCTTGGCCTATGATGAAACAACTCAAAAAATTTACTTTCTGAGCACAGAAT 

CTTCTCCCCAAGGGAGGCAACTATACAGTGCTTCTACTGAGGGACTATTGAATCGTGATTGCATCTCAT 

GCAACTTTATGMAGAAGATTGCACGTATTTTGATGCCAGCTTTAGCCCCATGMTCAGGATTTCTTGTT 

ATTCTGTGAAGGTCCAAAGGTCCCAGTGGTCAGCCTTCACATCACAGACAACCCATCAAGGTATTTCCT 

CTTGGAAAACAATTCTGTGATGAAAGAAACTATTCAGAAGAAGAAGCTCGCAAAGAGAGAGACTAGAAT 

ACTTCACATTGATGACTATGAACTTCCTTTACAGTTGTCCTTTCCCAAAGATTTTATGGAGAAAAACCAGT 

ATGCTCTTCTATTAATAATGGATGAAGAACCAGGAGGCCAAATGGTGACAGATAAGTTCCATGTTGACT 

GGGATTCAGTTCTTATTGACACCGATAATGTCATTGTAGCAAGATTTGATGGCAGAGGAAGTGGATTCC 

AGGGCCTGAAAGTTTTGCAGGAGATTCACAGAAGGATAGGCTCAGTGGAGGCAAAGGACCAAGTAGCT 

GCTGTAAMTATTTACTGAMCAGCCATATATTGACTCCAAMGATTMGCATTTTTGGAAAGGGATATG 

GGGGCTATATCGCATCAATGATCTTAAAATCAGATGAGAAGTTTTTCAAATGTGGAGCCGTGGTTGCAC 

CCATCTCAGACATGAAGTTGTATGCCTCAGCTTTCTCTGAACGGTACCTTGGCATGCCATCAAAGGAAG 

AAAGCACTTACCAGGCATCCAGTGTACTGCATAACATTCATGGTTTAAAAGAAGAAAATTTATTAATAATT 

CACGGAACTGCTGATACAAAAGTTCATTTCCAGCATTCAGCAGAATTGATCAAGCATCTGATAAAAGCT 

GGGGTGAATTACACTCTGCAGGTCTATCCAGATGAAGGATATCACATTTCAGACAAGAGCAAGCATCAT 
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Figure 9. Human DPP10 alternate exons and predicted peptides 
>Buman alternate exon lb 

AGTTAAACAGCCTGGGGCTCAGTGCTGCCATCCGTAAATTGGAGGATTAGAGAAATAATATACACAAGTACCTGGTACA 

TACCAAATATGATAGAAGATATCTTCAACCTTTAGCTATAAGAAGAATGGAAAATGAAAAAGAGACAGGTGGAGAGAGG 

GAGGGAGATGAGAGCTGCGTGGAGCTGGTATGCTGGTTAGGAGGTAGGAAATAATGAGGCCTCGACCATCTAAGATACA 

GAAGTCTCCCTGCTGAATCGAAAGGAAGTGCTACAATATTCTTTCAAATTCATCGCAGTTGAGAGAAAACTGGAGCAGA 

GTGTGATATCAAGAAAGGGCTTCACCAGGATGATCAATGCAAATGGATATGGAGCTTTCTTGGAATGAAACATTATCCC 
GTAACACCAG 

>Human alternate axon lc 

GTATACATACATAAAGAGTGGCTTCAGGACTG 

>Human alternate exon Id 

TTGAAGTACTTCTATGAACAGAAATCTTCCCTTATCAGCTACTTGGTTGAGATACAACAG 
> Human alternate exon le 

TTCTGAAAACTAACTAAGATGCTGGAGAAATTGGGAGACCA 
>Human alternate axon If 

ACQGACTCTGCGGGAAGTTAGAGCCTCTGCGTGCGCTCCGGGGCCCGGCGAGAGGATGCGCAAGGTGGAGAGCCGCGGG 
GAAGGGGGCAGAGAG 

>Human alternate exon lg 

GTTGGTTGACCTTTCTTCCCTTGTACAAAGGATTCTC 

ATAATTTTGATTCTCAATAGCAGTTTTCTGTACTCTAGTCTGTGAGTTACTGGGGAACATTTGAAGGCAAGAAGAATAG 
CAGTTAATACGGCGTGCCACCTTCTCACCATGCACACTGGATTCGCAG 

>Huiaan DPP10 Liver clone 1 (as far as exon 5) 

TCACCATGCACACTGGATTCGCAGGAACTGGGAAGTAACAGCCCTCCACAGAGAAACTGGAAGGGAATTGCTATTGCTC' 
TGCTGGTGATTTTAGTTGTATGCTCACTCATCACTATGTC 

AGAAACC AGATTGTCTTTGGAAGAC CTCTTTAGGAAAGACTTTGTGC TTC AC GATC C AGAGGC TCGGTGGATC AATGGT 
AACCTTCAAAGCATCAAGACATTCAGTTTCACCAGATTTAAAATATGTCCTTCTGGCATATGATGTCAAACAGATTTTT 
CATTATTCGTATACTGCTTCATATGTGATTTACAACATACACACTAG 

>Human DPP10 Liver clone 2 (as far as exon 5) 

TCACCATGCACACTGGATTCGCAGATGAACTCACAAATTCGTCAGAAACCAGATTGTCTTTGGAAGACCTbTOTAGGAA 
AGACTTTGTGCTTCACGATCCAGAGGCTCGGTGGATCAATGATACAGATGTGGTGTATAAAAGCGAGAATGGACATGTC 
ATTAAACTGAATATAGAAACAAATGCTACCACATTATTATTGGAAAACACAACTTTTGTAA 

>Human DPP10 Liver clone 3 (as far as exon 5) 

TCACCATGCACACTGGATTCGCAGGTAGCTGTATGAACTC 

TTTAGGAAAGACTTTGTGCTTCACGATCCAGAGGCTCGGTGGATCAATGATACAGATGTGGTGTATAAAAGCGAGAATG 
GACATGTCATTAAACTGAATATA^AAACAAATGCTACCACATTATTATTGGAAAACACAACTTTTGTAA 

>Human alternate exon 2a sequence (found downstream of mex4/exon la in 3' race 
clone) 



TTACCACTGGGAAGAAAAGTAAAATGCATACAGCCT 
CGGATGGCl l ri I CTTTAGAGGGGCATCCTTMG 

GGTCTTGCGGAAGGCAATTAACCCTGTTAAGGATTTCTCCCCCACTAACAGA^ 

TATCTGCCCAAACAAAAATGCTATGTAAGTGCTITGTATTATGATTTGTTCT 

ACGACATTTTCATAAGGAGTTGTATAGTG^ 

CAATTATTATTAACTATAATGATGATAAAAATTCATTTTGAAGTCACCCGGGGGATG 

AATATTGTAGTAACATCTGCAAGATTTCATCATGAACCTTTACCATGGAGGTAGGGGAAGAAGATTGGAT 

TCAAATGTCATTGAAAGAACTGATTAATTCAGTAAAAAAAATGCCAGTTAGTCATCCCAGTGCT 

CCAAAATGGAAAATACCTGGGTAGCTGCATTGATGGTACATATGGTTGCATTGAGGAGGAATTGTATCAT 

GGAATAAATATAGGCTTCATATGAAAAAAAAAAAAAAAAAAAAA 



> alternate exon 2b 

GAGAACCTGCCCAGAGGAGCCTGCTGGAGGATGAAGGACACGTGGAGCACAGCCATGTAGCCCTTGTCATCTCAGCCGT 
GGCCATCCTGGACCAAATGCTCACCTGCAATGTGTTCATTATACTTGTTGTGGCAATGCATAACCAATATGGTTGCTTT 
GCCTG^ATTTTGTGACCTCTTTGTGAGTGAGACATCTATGTCTTGATTAAGTCTCTTTCCCCCATTCCTAGAACAGTAG 
TTGGTCTATTACATTTTGGTTGAATTAAA 
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> alternate exon 2C 

GTTTTATGGATCCTGTGAGTCTGCTTATCACAGTCATTTG^AGAGAGAAACAACAACTGCCAATCATGAAACAGAGGAT 
GTGTGAGATATGGGCTTGCTGTGAACC^GATGATCCAAAGGGGATGCTGAAAGCTTAGCAGAACCAAATACTCAAATA 
GAAAAAATGTGGCTGATTTGCTTCAACAGCTTTCCGAGATAATATACCAAAGTGGAAACTGATAGCTTCTCCAGATCAT 
ACTTGCTCATTCTGTCCCCGATTCTTGGGCAGAACTCTC 

CAGAAGACCTAAAACAAGACCCATAGAGTAAAGGGACATGGGAAACAAAATGGCAAATCTGACATCAGAAAAAAAAAAA 
AAAAAAAA 

> alternate exon 2D 

GTACTTTGCTGCCTCCTAGTGTTGTGGTGAGGCTCCAGGCTGAGAGAAGTTTCCTCTSAGCTCCAGCTCAGCCAGCACC 

CTTCTGAGTGATGAACACTGTTGCAGACCATAGACCTGAGACAGCTGGCTGCGGTTTGACACAGTTTAGAAGATGCTAC 

TACATCAGAGGCCTCTCTTGTCCATTGGGTCCCTTGTAGTGGACCCCTTTGCTTTACTCCTGAATTGAGTATCAGTAGG 

GCATTTGTGCAGGGCTTTTAAAGACTTCTTTCCTCTCTCAAATACCTCCCTTCTTC 

TTGGAGTTTTTAATTCCCTCTGATCTT^ 

ACATAAATTTTCCAGTTAAAAAAANAAAAAA 

>alternate exon 2E 

GTACTTTGCTGCCTCCTAGTGTTGTGGTGAGGCTC^ 
CTTCTGAGTGATGAACACTGTTGCAGACCATAGACCTO^ 

TACATCAGAGGCCTCTCTTGTCCATTGGGTCCCTTGTAGTGGACCCCTTTGCTTTACTCCTGAATTGAGTATC 

GCATTTGTGCAGGGCTTTTAAAGACTTCTTTCCTCTCTCAAATACCTCCCTTCTTCCCAGTTC 

TTGGAGTTTTTAATTCCCTCTGATCTTAAGGTTA 

ACATAAATTTTCCAGTTAAAAAAANAAAAAA 

> alternate exon 2F 

GATCCATTGCTGTTTACATGGGCCCATTAAATGGAGTTATGACTATTTTC TAG TTTAT 
>alternate exon 2G 

GATCCATTGCTGTTTACATGGGCCCATTAAATGGAGOT^ 
GAACAATTACCTTTAAATATTGTTTCTGACTAAAG 

CTGACACTTCTACACTTAAGTGGACTCTTATCAGCCCAAGTAGGAGGTAAAATTATGGTAATCTAATCAGATATAAGAA 
GCATTTAGATAACCAAATTTCAAATTATTAGACAGCT^ 

ACCATCACTAAAGTCCATGATATGTATCGAGACAGTAGTTATTAAAAGTACATATTTGGGGAATGTGAAAAAAAAAAAA 
AAAAGAAAAAAAAA 

>huDPP10 Transcript 1 predicted peptide (796 residues) 

mQTASVSHHIKCQPSKTIKELGSNSPPQRJSWKGIAIALLVILWCSLITMSVILLTPDE 
LTNS SETRLSLEDLFRKDFWiHDPEARWINDTDVVYKS enghvi klni etnattlllent 
TFWFKASRHSVSPDLKYVLLAYDVKQIFHYSYTASYVTYNIHTREVWELNPPEVEDSVL 
QYAAWGVQGQQLIYIFENNIYYQPDIKSSSLRLTSSGKEEIIFNGIADWLYEEELLHSHI 
AHWWS PDGERLAFLMINDSLVPTMVI PRFTGALYPKGKQ YPYPKAGQVNPTI KLYWNLY 
GPTHTLELMPPDS FKSREYYITMVKWVSNTKTWRWLIJRPQNI S ILTVCETTTGAC SKKY 
EMTSDT1^SQQ1^EPVFSRIX5SKFFMWPVKQGGRGEFHHIAMFLIQSK^ 
GNWEVIKILAYDETTQKIYFLSTESSPRGRQLYSASTEGIjLNRQCISCNFMKEQCTYFDA 
SFSPMNQHFLLFCEGPRVPWSLHSTDNPAKYFILESNSMLKEAILKKKIGKPEIKILHI 
DDYELPLQLSLPKDF1^RNQYALLLIMDEEPGGQLVTDKFHIDV©SVLIDMDNVIVARFD 
- GRGSGFQGLKILQEIHRRLGSVEVKDQITAVKFLLKLPYIDSKRLSIFGKGYGGYIASMI 
LKSDEKLFKCGSWAPITDLKLYASAFSERYLGMPSKEESTYQAASVLHimiGLKEENIL 
IIHGTADTKVHFQHSAELIKHLlKAGVim^ 
LKEEISVLPQEPEEDE* 

>Buman DPP10 Transcript 2 predicted peptide (747 residues) 

MSVILLTPDELTNSSETRLSLEDLFRKDFVLHDPEARWim)TDVVYKSENGHVIKLNIET 
NATTLLLENTTFVTFKASRHSVSPDLKYVLLAYDVKQIFHYSYTASWIYNIHTREVWE^ 
NPPEVEDSVLQYAAWGVQGQQLIYIFENNIYYQPDIKSSSLRLTSSGKEEIIFNGIADWL 
YEEELLHSHI AHWWS PDGERLAFLMINDSLVPTMVI PRFTGAL YPKGKQYP YPKAGQVNP 
TIKLYVVl^YGPTHTLELMPPDSFKSREYYITMVKWVSOT 

TTTGACSKKYEMTSDTWLSQQNEEPVFSRDGSKFFMTVPVKQGGRGEFHHIAMFLIQSKS 
EQITVRHLTSGNWEVIKILAYDETTQKIYFLSTESSPRGRQLYSASTEGLLNRQCISCNF 
MKEQCTYFDASFSPMNQHFLLFCEGPRVPVVSLHSTDNPAKYFILESNSMLKEAILKKKI 
GKPEIKILHIDDYELPLQLSLPKDFMDRNQYALLLIMDEEPGGQLVTDKFHIDWDSVLID 
MDNVIVARFDGRGSGFQGLKILQEIHRRLGSVEVKDQITAVKFLLKLPYIDSKRLSIFGK 
GYGGYIASMILKSDEKLFKCGSWAPITDLKLYASAFSERYLGMPSKEESTYQAASVLHN 
VHGLKEENILI IHGTADTKVHFQHS AEL I KHL I KAGVNYTMQVYPDEGHNVS EKSKYHLY 
STILKFFSDCLKEEISVLPQEPEEDE* 
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>Human DPPIO Transcript 3 predicted peptide (747 residues) non-synonymous SNP 
underlined 

MSGILLTPDELTNSSETRLSLEDLFRKDFVLHDPEARWIIOTTDWYKSENG 

NATTLLLENTTFVTFKASRHSVSPDLKYVLLAYDVKQIFHYSYTASYVIYNIHTREVWEL 

NPPEVEDSVLQYAAWGVQGQQLIYIFENNIYYQPDIKSSSLRLTSSGKEEIIFNGIADWL 

YEEELLHSHIAHWWSPDGERLAFLMINDSLVPTMVIPRFTGALYPKGKQYPYPKAGQVNP 

TIKLYVV^YGPTHTLELMPPDSFKSREYYrTWKWSOTKTVWW^ 

TTTGACSKKYEMTSDTWjSQQI^EPVFSRDGSKFFMTVPVKQGGRGEFHHIAMFLIQSKS 

EQITVRHLTSGNWEVIKILAYDETTQKIYFLSTESSPRGRQLYSASTEGLLNRQCISCNF 

MKEQCTYFDASFSPMNQHFLLFCEGPRVPWSLHSTDNPAKYFILESNSMLKEAILKKKI 

GKPEIKILHIDDYELPLQLSLPKDFMDRNQYALLLIMDEEPGGQLVTDKFHIDWDSVLID 

MDNVIVARFDGRGSGFQGLKILQEIHRRLGSVEVKDQITAVKFLLKLPYIDSKRLSIFGK 

GYGGYIASMILKSDEKLFKCGSWAPITDLKLYASAFSERYLGMPSKEESTYQAASVLHN 

VHGLKEENILIIHGTADTKVHFQHSAELIKHLIKAGVOTTMQVYPDEGHWSEKSKYHLY 

STILKFFSDCLKEEI SVLPQEPEEDE * 

>Human DPPIO Transcript 4 predicted peptide (789 residues) 

MRKVESRGEGGREELGSNSPPQRNWKGIAIALLVILWCSLITMSVIIiLTPDELTNSSET 

RliSLEDbFRKDFVLHDPEARWIOTTDVVYKSENGHVIKLNIETNATTLL 

SRHSVSPDLKYVLLAYDVKQIFHYSYTASWIYNIHTREVWELNPPI^^ 

QGQQLIYIFENNIYYQPDIKSSSLRLTSSGKEEIIFNGIADWLYEEELLHSHIAHWWSPD 

GERLAFLMINDSLVPTMVIPRFTGALYPKGKQYPYPKAGQVNPTIKLYV^ 

LMPPDSFKSREYYITMVKWVSNTK07VVRWL.NRPQNI S ILTVC ETTTGACSKKYEMTSDTO 

LSQQNEEPWSRDGSKFFMWPVKQGGRGEFHHIAMFLIQSKSEQITVRHLTSGNWEVIK 

ILAYDETTQKI YFLSTES S PRGRQLYS ASTEGLLNRQC I SCNFMKEQCTYFDASFSPMNQ 

HFLLFCEGPRVPWSLHSTDNPAKYFILESNSMLKEAILKKKIGKPEIKILHIDDYELPL 

QLSLPKDFMDRNQYALLLIMDEEPGGQLVTDKFHIDWDSVLIDMDNVIVARFDGRGSGFQ 

GLKILQEIHRRLGSVEVKDQITAVKFLLKLPYIDSKRLSIFGKGYGGYIASMILKSDEKL 

FKCGSVVAPITDLKLYASAFSERYLGl^SKEESTYQAASVLHIWHGLKEENILI IHGTAD 

TKVHFQHSAELIKHLIKAGVlSnfTMQVYPDEGHWSEKSKYHLYSTILKF 

LPQEPEEDE* 

>Human DPPIO Transcript 5 predicted peptide (747 residues) 

MSVIIiIiTPDELTNSSETRLSLEDLFRKDFVLHDPEARWINDTDWYKSENGHVIKLNIET 
NATTLLLENTTFVTFKASRHS VSPDLKYVLLAYDVKQ I FHYS YTAS YVI YNIHTREVWEL 
NPPEVEDSVIiQYAAWGVQGQQLIYIFENNIYYQPDlKSSSLRLTSSGKEEIIFNGIADWL 
YEEELLHSH IAHWWSPDGERLAFLMINDSLVPTMVX PRFTGALYPKGKQYPY PKAGQVNP 
TIKLYVVNLYGPTHTLELMPPDSFKSREYYITMVK^ 

TTTGACSKKYEMTSDTWLSQQNEEPVFSRDGSKFFMTVPVKQGGRGEFHHIA^^ 

EQITVRHLTSGNWEVIKILAYDETTQKIYFLSTESSPRGRQLYSASTEGriLNRQCISCNF 

MKEQCTYFDASFSPMNQHFLLFCEGPRVPWSLHSTDNPAKYFILESNSMLKEAILKKKI 

GKPEIKILHIDDYELPLQLSLPKDFMDRNQYALLLIMDEEPGGQLVTDKFHIDWDSVLID 

MDI^VARFDGRGSGFQGLKILQEIHRRLGSVEVKDQITAVl^LLKLPYIDSKRLSIFGK 

GYGGYIASMILKSDEKLFKCGSWAPITDLKLYASAFSERYLGMPSKEESTYQAASVLHN 

VHGLKEEOTLIIHGTADTKVHFQHSAELIKHLIKAGVNYTMQVYPDEGHNVSEKSKYHLY 

STILKFFSDCLKEEI SVLPQEPEEDE* 

>Huraan DPPIO Transcript 6 predicted peptide (47 residues) 

MNQTASV SHHI KCQPSKTI KLPLGRKVKC I QPALLPANGYKADRMAFSLEGHP * 

>Exon lb 3' RACE clone transcript 1 predicted peptide 

MQMDMELSWN ETLSRNTRYT YMKSGFRTGE PAQRSLLEDE GHVEHSHVAL 
VISAVAILDQ MLTCNVFIIL WAMHNQYGC FACIL* 

>Exon lb 3' RACE clone transcript 2 predicted peptide 

MQMDMELSWN ETLSRNTRYT YMKSGFRTGF MDPVSLLITV I* 

>Exon lb 3' RACE clone transcript 3 predicted peptide 

MQMDMELSWN ETLSRNTRYT YMKSGFRTGT LLPPSVWRL QAERSFL* 

>Exon lb 3' RACE clone transcript 4 predicted peptide 

MQMDMELSWN ETLSRNTRYT YMKSGFRTGS IAVYMGPLNG VMTIF* 

>Exon lb 3 ' RACE clone transcript 5 predicted peptide 

MQMDMELSWN ETLSRNTRYT YMKSGFRTGS IAVYMGPLNG VMTIF* 
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>Human transcript 6 (Mex4-6 "stopper") 

GTCCCATCACATCAAGTGTCAACCCTCAAAAACAATCAAGTTACCACTGGGAAGAAAAGTAAAATGCAT 

ACAGCCTGCCCTACTACCTGCAAACGGATATAAGGCAGATCGGATGGCTTTTTCTTTAGAGGGGCATCC 

TTAAGTGGAGTTCTCACTCTCCCTTTACCACATGAACCATGTGGTCTTGCGGAAGGCAATTMCCCTGT 

TAAGGATTTCTCCCCCACTAACAGAAGGGCAATAAACTTAAATTATCTGCCCAAACAAAAATGCTATGTA 

AGTGCTTTGT ATTAT GATTTGTTCTAGTAGCMTTTTTAAGGATACGACATTTTCATAAGGAGTTGTATAG 

TGAGMTCAATTTTTATGACATAAAAGCTGAGMTTTTATTTGCMTTATTATTAACTATAATGATGATAAA 

MTTCATTTTGMGTCACCCGGGGGATGAAMTTATCTTCTAATATTGTAGTAACATCTGCAAGATTTCAT 

CATGMCCTTTACCATGGAGGTAGGGGAAGAAGATTGGATTCAAATGTCATTGAAAGAACTGATTAATT 

CAGTAAAAAAAATGCCAGTTAGTCATCCCAGTGCTGTGCTACCAAAATGGAAAATACCTGGGTAGCTGC 

ATTGATGGTACATATGGTTGCATTGAGGAGGMTTGTATCATGGAATAAATATAGGCTTCATATGAAAAA 
AAAAAAAAAAAAAAAA 

>human 3' RACE clone 

TGAGATGACATCAGATACGTGGCTCTCTCAGCAGAATGAGGAGCCCGTGTTTTCTAGAGACGGCAGCA 
AATTCTTTATGACAGTGCCTGTTAAGCAAGGGGGACGTGGAGAATTTCACCACATAGCTATGTTCCTCA 
TCCAGAGTAAAAGTGAGCAAATTACCGTGCGGCATCTGACATCAGGAAACTGGGAAGTGATAAAGATCT 
TGGCATACGATGAAACTACTCAAAAAATTTACTTTCTGAGCACTGAATCTTCTCCCAGAGGAAGGCAGC 
TGTACAGTGCTTCTACTGAAGGATTATTGAATCGCCMTGCATTTCATGTAATTTCATGAAAGAACAATG 
TACATATTTTGATGCCAGTTTTAGTCCCATGMTCMCATTTCTTATTATTCTGTGAAGGTCCAAGGGTCC 
CAGTGGTCAGCCTACATAGTACGGACAACCCAGCAAAATATTTTATATTGGAAAGCAATTCTATGCTGAA 
GGAAGCTATCCTGAAGAAGAAGATAGGAAAGCCAGAAATTAAAATCCTTCATA7TGACGACTATGAACTT 
CCTTTACAGTTGTCCCTTCCCAAAGATTTTATGGACCGAAACCAGTATGCTCTTCTGTTAATAATGGATG 
AAGAACCAGGAGGCCAGCTGGTTACAGATAAGTTCCATATTGACTGGGATTCCGTACTCATTGACATGG 
ATAATGTCATTGTAGCAAGATTTGATGGCAGAGGAAGTGGATTCCAGGGTCTGAAAATTTTGCAGGAGA 
TTCATCGAAGATTAGGTTCAGTAGA AGTAAA GGACCAAATAACAGCTGTGAAATTTTTGCTGAAACTGCC 
TTACATTGACTCCAAA AGATTA AGCATTTTTGGAAAGGGTTATGGTGGCTATATTGCATCAATGATCTTA 
AAATCAGATGAAMGCTTTTTAAATGTGGATCCGTGGTTGCACCTATCACAGACTTGAMTTGTATGCCT 
CAGCTTTCTCTGAAAGATACCTTGGGATGCCATCTAAGGAAGAAAGCACTTACCAGGCAGCCAGTGTG 
CTACATAATGTTCATGGCTTGAAAGAAGAAAATATATTAATAATTCATGGAACTGCTGACACAAAAGTTC 
ATTTCCAACACTCAGCAGMTTAATCAAGCACCTAATAAAAGCTGGAGTGAATTATACTATGCAGGTCTA 
CCCAGATGAAGGTCATAACGTATCTGAGAAGAGCAAGTATCATCTCTACAGCACAATCCTCAAATTCTTC 
. AGTGATTGTTTGAAGGAAGAAATATCTGTGCTACCACAGGAACCAGAAGAAGATGAATAATGGACCGTA 
TTTATACAGAACTGAAGGGAATATTGAGGCTCAATGAAACCTGACAAAGAGACTGTAATATTGTAGTTGC 
TCCAGAATGTCAAGGGCAGCTTACGGAGATGTCACTGGAGCAGCACGCTCAGAGACAGTGAACTAGCA 
TTTGAATACACAAGTCCAAGTCTACTGTGTTGCTAGGGGTGCAGAACCCGTTTCTTTGTATGAGAGAGG 
TCAMGGGTTGGTTTCCTGGGAGAAATTAGTTTTGCATTAAAGTAGGAGTAGTGCATGTTTTCTTCTGTT 
ATCCCCCTGTTTGTTCTGTAACTAGTTGCTCTCATTTTAATTTCACTGGCCACCATCATCTTTGCATATAA 
TGCACAATCTATCATCTGTCCTACAGTCCCTGATCTTTCATGGCTGAGCTGCAATCTAACACTTTACTGT 
ACCTTTATAATAAGTG CAATTCTTTCATTGTCTATTATTATG CTTAAG AAAATATTCAGTTAATAAAAAAC A 

GAGTATTTTATGTAATTTCTGTTTTTAAAAAGACATTATTAAATGGGTCAAAGGACATATAGAAATGTGGA 

TTTCAGCACCTTCCAAAGTTCAGCCAGTTATCAGTAGATACAATATCTTTAAATGAACACACGAGTGTAT 

GTCTCACAATATATATACACAAGTGTGCATATACAGTTAATGAAACTATCTTTAAATGTTATTCATGCTAT 

AMGAGTAMCGTTTGAT GMTTA GMGAGATGCTCTTTTCCAAGCTATAATGGATGCTTTGTTTAATGA 

GCCAMTATGATGAAACATTTmCCMTTCAMTTCTAGCTATTGCm 

TTTGGTATTGT1TTTAGTGGTTMTAGTTTTCCAGTTGCATTTAATTTTTTGMTATGATACCTTGTCACAT 
GTAMTTAGATACTTAAATATTAAATTATAGTTTCTGATAAAMAAAAAAAAAAAAAAAAAA 
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Figure 10. Sequences of the DPP10 exons. The 5' UTR is included in exon 1 and the 3' UTR 
sequence in exon 25. The coding regions of these two exons are In bold. 



>5 r UTR/exonl 

CTCCGGAGTGAGGAAGCAGCAGAAACAGAAGCAGCAGAAGCAACAGCAGTAGCAGCGGCAG 
CAGCAACAGCAGCAGCCCCTACTGAAGTCCAATAGAGGAGACTTGATCTCTAGTTCATTCTGG 
AACTCCGCCTGGGATTGTGCACTGTCCAGGGTCCTGAAACATGAACCAAACTGCCAGCGTGT 
CCCATCACATCAAGTGTCAACCCTCAAAAACAATCAAG 

>exon 2 

GAACTGGGAAGTAACAGCCCTCCACAGAGAAACTGGAAGGGAATTGCTATTGKTTCTGCTGGT 
GATTTTAGTTGTATGCTCACTCATCACTATGTCAGTCATCCTCTTAACCCCAG 

>exon 3 

ATGAACTCACAAATTCGTCAGAAACCAGATTGTCnTTC 
TGCTTCACGATCCAGAGGCTCGGTGGATCAATG 

>exon*4 

ATACAGATGTGGTGTATAAAAGCGAGAATGGACATGTCATTAAACTGAATATAGAAACAAAT 
GCTACCACATTATTATTGGAAAACACAACTTTTGTAA 

>exon 5 

CCTTCAAAGCATCAAGACAITCAGTTTCACCAGATTTAAAATATGTCCTTCTGGCATATGATG^ 
CAAACAGATTTTTCATTATTCGTATACTGCTTCATATGTGATTTACAACATACACACTAG 

>exon 6 

GGAAGTTTGGGAGTTAAATCCTCCAGAAGTAGAGGACTCCGTCTTGCAGTACGCGGCCTGGG 
GTGTCCAAGGGCAGCAGCTG 

>exon7 

ATTTATATTTTTGAAAATAATATCTACTATCAACCTGATATA^ 
CATCTTCTGGAAAAGAAGAAATAATTTTTAATGGGATTGCTGACTGGTTATATGAAG 

>exon8 

AGGAACTCCTGCATTCTCAC^^ 

TGATGATAAATGACTCnTTGGTACCCACCATGGTTATCCCTCGGTTTACTGGAGCGTTGTATCC 
CAAAGGAAAGCAGTATCCGTATCCTAAG 

>exon 9 

GCAGGTCAAGTGAACCCAACAATAAAATTATATGTTGTAAACCTGTATGGACCAACTCACACT 
TTGGAGCTCATGCCACCTGACAGCTTTAAATCAAG 

>exon 10 

AGAATACTATATCACTATGGTTAAATGGGTAAGCAATACCAAGACTGTGGTAAGATGGTTAA 
ACCGAGCTCAGAACATCTCCATCCTCACAGTCTGTGAGACCACTACAGGTGCT 

>exonll 

AAATATGAGATGACATCAGATACGTGGCTCTCTCAGCAG 
>exon 12 

AATGAGGAGCCCGTGTTTTCTAGAGACGGCAGCAAATTCTTTATGACAGTG(Xn , GTTAAGCAA 
GGGGGACGTGGAGAATTTCACCACGTAGCTATGTTCCTCATCCAG 
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>exon 13 

AGTAAAAGTGAGCAAATTACCGTGCGGCATCTGACATCAGGAAACTGGGAAGTGATAAAGAT 
CTTGGCATACGATGAAACTACTCAAAAAAT 

>exon 14 

TTACTTTCTGAGCACTGAATCTTCTCCCAGAGGAAGGCAGCTGTA 
>exon 15 

TGCTTCTACTGAAGGATTATTGAATCGCCAATGCATTTCATGTAAT1TCATGAAAGAACAATG 
TACATATTTTGATGCCAGTTTTAGTCCCATGAATCAACATTTCTTATTATTCTG 

>exon 16 

GTCCAAGGGTCCCAGTGGTCAGCCTACATAGTACGGACAACCCAGCAA 
>exon 17 

AATATTTTATATTGGAAAGCAATTCTATGGTGAAGGAAGCTATCCTGAAGAAGAAGATAGGA 
AAGCCAGAAATTAAAATCCTTCATATTGACGACTATG 

>exon 18 

AACTTCCmACAGTTGTCCCTrCCCAAAGATTTTATGGACCGAAACCAGTAT 
AATAAT 



>exon 19 

GGATGAAGAACCAGGAGGCCAGCTGGTTACAGATAAGTTCCATATTGACTGGGATTCCGTACT 

CATTGACATGGATAATGTCATTGTAGCAAGATTTGATGGCAGAGGAAGTGGATTCCAGGGTCT 

GAAAATTTTGCAGGAGATTCATCGAAGATTAGGTTCAGTAGAAGTAAAGGACCAAATAACAG 
CTGTGAA 



>exon 20 

ATTTTTGCTGAAACTGCCTTACATTGACTCCAAAAGATTAAGC^ 
>exon21 

GGTTATGGTGGCTATATTGCATCAATGATCTTAAAATCAGATGAAAAGCTTTTTAAATGTGGA 
TCCGTGGTTGCACCTATCACAGACTTGAAATTGTATG 

>exon 22 

CCTCAGCTTTCTCTGAAAGATACCTTGGGATGCCATCTAAGGAAGAAA 
>exon23. 

GCAGCCAGTGTGCTACATAATGTTCATGGCTTGAAAGAAGAAAATATATTAATAATTCATGGA 
ACTGCTGACA 

>exon 24 

CAAAAGTTCATTTCCAACACTCAGCAGAATTAATCAAGCACCTAATAAAAGCTGGAGTGAATT 
ATACTATGCAG 



>exon 25 

GTCTACCCAGATGAAGGTCATAACGTATCTGAGAAGAGCAAGTATCATCTCTACAGCAC 
AATCCTCAAATTCTTCAGTGATTGTTTGAAGGAAGAAATATCTGTGCTACCACAGGAACC 
. AGAAGAAGATGAuATAATGGACCGTATTTATACAGAACTGAAGGGAATATTGAGGCTCAATG 
AAACCTGACAAAGAGACTGTAATATTGTAGTTGCTCCAGAATGTCAAGGGCAGCTTACGGAG 
ATGTCACTGGAGCAGCACGCTCAGAGACAGTGAACTAGCATTTGAATACACAAGTCCAAGTC 
TACTGTGTTGCTAGGGGTGKTAGAACCCGm 

CTGGGAGAAATTAGTTTTGCATTAAAGTAGGAGTAGTGCATGTTTTCTTCTGTTATCCCCCTGT 
TTGTTCTGTAACTAGTTGCTCT^^ 

CAATCTATCATCTGTCCTACAGTCCCTGATCTTTCATGGCTGAGCTGCAATCTAA^ 
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GTACCTTTATAATAAGTGCAATTCTrTCATTGrCTATTATTGTGCTTAAGAAAATATTCAGTTA 

ATAAAAAACAGAGTATTTTATGTAATTTCTGTTTTTAAAAAGACATTATTAAATGGGTCAAAG 

GACATATAGAAATGTC3GATTTCAGCACCTTCCAAAGTTCAGCCAGTTATCAGTAGATACAATA 

TCTTTAAATGAACACACGAGTGTATGTCTCACAATATATATACACAAGTGTGCATATACAGTT 

AATGAAACTATCTTTAAATGTTATTCATGCTATAAAGAGTAAACGTTTGATGAATTAGAAGAG 

ATGCTCTTTTCCAAGCTATAATGGATGCTTTGTTTAATGAGCCAAATATGATGAAACATTTTTT 

CCAATTCAAATTCrAGCTATTGCTTTCCTATAAATGTTTGGGTTGTGTTTGGTATTGTTTTTAGT 

GGTrAATAGTTTTCCAGTTGCATTTAATTTTTTGAATATGATACOT 

rACTTAAATATTAAATTATAGTYTCTGATAAAGAAATTTTGTTAACAATGCAA 
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Figure 11. Annotated DPP10 transcript 1 predicted protein sequence -796 amino acids 



MNQTASVSHH IKCQPSKTIK ELGSNSPPQR NWKG 



51 ^^gpTPDE LTNSSETRLS LEDLFRKDFV LHDPEARWIN" DTDWYKSEN 

101 GHVIKLNIET NATTLLLENT TFVTFKASRH SVSPDLKYVL LAYDVKQIFH 

151 YSYTASYVIY NIHTREVWEL NPPEVEDSVL QYAAWGVQGQ QLIYIFENNI 

201 YYQPDIKSSS LRLTSSGKEE IIENGIAflH ^&LHSHI AHWWSPDGER 

251 LAFLMlgDSL VPTMVIPRFT GALYPKGKQY PYPKAGQVNP TIKLYWNLY 

301 GPTHTLELMP PDSFKSREYY ITMVKWVSNT KTWRWLNRP QglSILTVCE 

351 TTTGACSKKY EMTSDTWLSQ QNEEPVFSRD GSKFFMTVPV KQGGRGEFHH 

401 IAMFLIQSKS EQITVRHLTS GNWEVIKILA YDETTQKIYF LSTESSPRGR 

451 QLYSASTEGL LNRQCISCNF MKEQCTYFDA SFSPMNQHFL LFCEGPRVPV 

501 VSLHSTDNPA KYFILESNSM LKEAILKKKI GKPEIKILHI DDYELPLQLS 

551 LPKDFMDRNQ YALLLIMDEE PGGQLVTDKF HIDWDSVLID MDNVTVARFD 

601 GRGSGFQGLK ILQEIHRRLG SVEVKDQITA VKFLLKLPYI DSKRLSIFGK 

651 @YGGYIASMI LKSDEKLFKC GSWAPITDL KLYASAFSER YLGMPSKEES 

701 TYQAASVLHN VHGLKEENIL IIHGTAjjjTKV HFQHSAELIK HLIKAGVfYT 
751 MQVYPDEGgjg VSEKSKYHLY STILKFFSDC LKEEISVLPQ EPEEDE* 



Catalytic triad: 
Cytoplasmic domain: 1-34 
Transmembrane domain: 35-56 
N-linked O-Glycosylation sites: 



MNQTASVSHHIKCQPSKTIKELQSNSPPQRNWKG 



a— 




p-propeller domain: 



228-234 



i 
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Figure 12. Annotated amino acid DPP10 sequence; Amino acids 34-54 (underlined) are predicted 
to traverse the membrane. Repeat sequences within 3 propellers are shown in bold italics. Residues 
homologous to catalytic residues of DPP4 are underlined and shaded. 



I^QTASVSHHIKCQPSKTIKELGSNSPPORNW KGIAIALLVILVVCSLITMSVILL TPDR 
LTNS S ETRL SLEDLFRKDFVLHDPEARWINDTDVVYKSENGHVI KLNI BTNATThLLENT 
TFVTFKASRHSVSPDLKYVLLAYDVKQIFITX'SYTASYVXYNIH 

QYAAWQVQGQQLIYIFENNXYYQPDIKSSSI&LTSSGKER11 FNGI «?■ W«2r«te* Wr-I 
«\ ':'<•>.,:> U V, U- nv^&-<'Z'iW.\ V^'"\\ ^ * .* PKGKQYPYPKAGQVNPTIKLYWNLY 
GPTHTLELMPPDSFKSREYYITMVKWSNTKTWRWLNRPQNISILWCETTTGACSKKY 
EMTSDTWLSQQNEEPVFSRDGSKFFMTVPVKQGGRGEFHHIAMFLIQSKSEQITVRHLTS 
GNWEVIKILAYDETTQKIYFLSTESSPRGRQLYSASTO 

SFSPMNQHFLLFCEGPRVPWSLHSTDNPAKYFILESNSML (end of regulatory domain) 

KEAILKKKIGKPEIKILHIDDYELPLQLSLPKDFMDRNQYALLLIMDEEPGGQLVTDKFH 

IDWDSVLID^WIVARFDGRGSGFQGLKILQEIHRRLGSVEVKDQITAVKFLLKLPYID 

SKRLSIFGI^YGGYIASMILKSDEKLFKCGSWAPITDLKLYASAFSERYLGMPSKEEST 

YQAASVLHNVHGLKEENILIIHGTA^ 

SEKSKYHLYSTILKFFSDCLKEEISVLPQEPEEDE 



t 
i 



wo 02/086113 A) JjUfpcTGiwoisy 

. 187/219 



mojjbiai euog 
puB|Q |euaipv * !': 
B9L|OBJl '.»••' 

PJOQ |BUjdS :i 

pjOjAgi * 



mm ^ 



in 

CM 



CM 
O 



uo|oq . 
eu!}S9iU| puis >. 

sntuAiii > 



mm 



CM 



m 

CM 



CM 



seaiouBd 




<1 



BJU90B|d 
U|Bjg| 



-A- 



O 
CO 

c 

CO 



CL 
CL 
Q 
0) 

sz 
*5 

CO 

"co 

05 

c 

CO 

o 
c 

i— 
0) 

x: 
tr 
o 
z 

CO 

o 

v. 

O) 
UL 



mm -^t 



WO 02/086113 ! 



188/219 



Figure 14 

Multiple alignment of the catalytic domains of DpplO homologues 

Red asterisks (*) mark the catalytic site positions. 
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DpplO 
CG9 059 
DFP6_HUMAN 
DPP4_HUMAN 
FAPa 

Consensus/ 80% 
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KGLDN — R^^TKffiiKHRVSAiBEEQQ^jJ 
jPTPEDl^D^!U^STffiplRAEl^KQVE& 

JPTKDDI^E^KNSTg^RAEY^B 

Pi *sacbYsSsa *ERYbGL . s . -p . . chYp . tsVhh+spsb+p . pbLl IHGTADspVHFQ 
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Figure 16. Multiple alignment of the transmembrane region of DPP10 and 
homologies. 



DPP10_HUMAN 

DPP6JHUMAN 
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Figure 18. Stereo representation of the structure of Prolyl oligopeptidase. 
(Fulop, V., Bocskei, Z. & Polgar, L. Prolyl oligopeptidase: an unusual beta- 
propeller domain regulates proteolysis. Cell 94, 161-70 (1998).) 
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Figure 19 

Cytokines containing PxS motifs 

Human chemokines and cytokines that have a serine within 10 amino acids of a 
predicted signal peptide cleavage site and which contain a PxS motif (where V 
represents any amino acid). The predicted signal peptide cleavage bond is shown as 
" A " and PxS motifs are highlighted. The first group of molecules contain the PxS two 
residues after the cleavage site. 



capasa A slYSSDTTPC 
lclsdg A I<B^S|LSYRCP 
spggla A GJSR^VPTTCC 

isgigg A v?|SRTVRCT 

spqgla A Q|»I)^V r SIPIT 
iflglg A Q|i|^PKSKRK 
alvtns A A§jpSpSTKKT 
gp 1 as a A Gpy £f AVLTEL 
1 1 1 vr a A |QTTTAA 

pedvds A KSMQV*J^RC 
isaalc A KGAVLiI#AK 
pdp t aa A FLL^gjgTACC 



RANTES 

SDF-1 

EOTAXIN 

IP10 

MCP-2 

IL17JS 

IL2 

GCP-2 

IL18 BP 

CHEMOKINE CC-l/CC-3 

INTERLEUKIN-8 

CTACK/ALP/ILC 
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Figure 21. Clustal alignment of mouse and human transcript 1 peptides. 



Human 
Mouse 



Human 
Mouse 



MNQTASVSHHIKCQPSKTIKELGSNSPPQRNWKGIAIALLVILWCSLIT24SVILLTPDE 

MNQTASVSHHIKCQPSKTIKELGSNSPPQRNWKGIAIALLVILWCSLITMSVILLTPDE 
*********************************************************^ # 

LTNSSETRLSLEDLFRKDFVLHDPEARWINOTDVVYKSENGHVIKLNIETNATTLLLENT 
LTNSSETRLSLEELLGKGFGLHNPEARRI^TVVVTO 

************.*. *^* **.**** **** ****..****.*** *.**.****.*. 



Human 
Mouse 



Human 
Mouse 



Human 
Mouse 



Human 
Mouse 



TFVTFKASRHSV S PDLKYVLL AYDVKQI FH YS YTAS YVT YNI HTREVWELNP PEVED S VL 
TFVTFKASRHSLSPDLKYVLLAYDVKQIFHYSFTASYLIYNIHTGEVWELNPPEVEDSVL 

***********. ******************** .**** . ****** *************** 



QYAAWGVQGQQLIYIFENNIYYQPDIKSSSLRLTSSGKEGIIFNGIADWLYEEELLHSHI 
*************************************** ******************** 

AHWW S PDGBRLAFLMINDSLVPTMVI PRFTGAL YPKGKQ YPYPKAGQVNPTIKL YWNI/Y 
AHWWS PDGERLAFLMINDSLVPNMl iprftgalypkakqypypkagqanpsvklywnly 
***********************.*********** ********** ** . .******** 

GPTHTLELMPPDSFKSREYYITMVKWSNTKTVTO 

GPTHTLEI^PDIFKSREYYITMVKWSM'RTVVRV^NRPQNISILTI^ESTTGAGSRKY 
************ *****************.****************.**.******.** 



Human 
Mouse 



EMTSDTWLS QQNEE PVF SRDGSKFFMTVPVKQGGRGEFHH IAMFL I QSKS EQ I TVRHLTS 

EMTSDTWLSKQNEEPWSRDGSKFXMTVLVKQGGRGEFHHIAMFLVQSKSEQITVRHLTS 
*********.************** *** ****************.************** 



Human 
Mouse 



GNWEVT K I LAYDETTQK I YFL S TES SPRGRQLYSASTEGLLNRQC I SCNFMKEQC T YFDA 

GNWEVIRILAYDETTQKIYFLSTES SPQGRQLYSASTEGLLNRDC I SCNFMKEDCTYFDA 
******.********************. ***************.*********.****** 



Human 
Mouse 



SFSPMNQHFLLFCEGPRVPWSLHSTDNPAKYFILESNSMLKEAILKKKIGKPEIKILHI 

SFSPMNQHFLLFCEGPKVPWSLHITDNPSRYFLLENNSVl^ETIQKKKLAKRETRILHI 
****************.******* ***'*..**.** **..**.* * * .**** 



Human 
Mouse 



Human 
Mouse 



DDYELPLQLSLPKDFMDRNQYALLLIMDEEPGGQLVTDKFHIDVTOSVlilDMDNVIVARFD 
DDYELPLQLSF PKDFMEKNQYALLL IMDEE PGGQMVTDKFHVDWDSVL IDTDNVI VARFD 
**********.*****. .****************.******.******** ********* 

GRGSGFQGLKILQEIHRRLGS VEVKDQI TAVKFLLKLPYIDSKRLS I FGKGYGGYIASMI 
GRGSGFQGIJCVLQE IHRR IGS VEAKDQVAAVKYLLKQPYIDSKRL S I FGKGYGGYIASMI 
********** . *******.**** m *** . .***.**•* *********************** 



Human 
Mouse 



LKSDEKLFKCG SVVAPI TDLKLYAS AFS ERYLGMPSKEES TYQ AASVLHNVHGLKEENIL 
LKSDEKFFKCGAWAPI SDMKL YASAFSERYLGMPSKEES TYQAS SVLHNIHGLKEENLL 
******:****.*****»*.************************.*************.* 



Human 
Mouse 



1 1 HGTADTKVHFQHS AEL IKHL I KAGVNYTMQVYPDEGHNVS EK SK YHL YS T I LKFF SD C 

IIHGTADTKVHFQHSAELIKHLIKAGVNYTLQVYPDEGYHISDKSKHHFYSTILRFFSTC 
****************************** »*******...*.***.*.*****.***** 



Human 
Mouse 



LKEEISVLPQEPEEDE 
LKEEVSVLPQEPEEDE 
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Figure 22a 

Linkage disequilibrium within the asthma locus 

The axis of the region runs along the diagonal from the bottom left to the top 
right the figure. Multiple pairwise comparisons between markers are shown, 
with pair-wise D' values for LD colour coded and plotted at the marker 
locations. Bright red and dark blue are opposite ends of the scale, with bright 
red indicating the most significant LD. The plot was completed by 
interpolation, using the program GOLD. The scale bar indicates a distance of 
200Kb 



Figure 22b 

Location of DPP10 exons within the LD map 

The disposition of the initial exons of DPP10 are shown relative to the LD 
map. Significant association to the LnlgE and asthma is indicated by arrows 
above the exons. The scale bar indicates a distance of 50Kb. The peak of 
association to asthma is adjacent to the 2a (stopper) exon. 
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FIGURE 26 Example of Pyrosequencing expected programs for SNP genotyping 



PSQ 



5' 3' 
AATTCTCGTTAAAATATGT 



GGAGOTCCTATrOATTAAGAGCAATrn<\TACARTCAGGAAACTTATCTTTP- 



mm 1 

mm ° 



-GATGTClTrGAAGGTCGAACC 
5 1 
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Figure 27 

Western blot of diluted human serum samples probed with anti-Dpp10 
C-term antibody 

Four human serum samples (WT1-4) were diluted in sample buffer (1/20, 1/40, 
1/80), separated on 12% polyacrylamide gel and blotted onto nitrocellulose. Also 
shown are molecular weight markers (MW) and a brain lysate control sample. Non- 
specific cross reaction of the human heavy chain IgG with the anti-rabbit IgG is 
denoted by the arrow. 
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Table 1a. SNPs in the LD region. 



SNP NAME 


BASE CHANGE 


POSITION 


BETWEEN 
SNPs 


69WTC50W 


IA/U) 


91 Q1 Q 
Ziolo 




416WTP103P 


(U/C) 


z/Uoz 


5245 


416WTC84P 






3815 


416WTC98P 


(A/Ij; 


43993 


15118 


416WTP9QP 


(U/L) 


4/U46 


1054 


41 fiWTPQ^P 




49617 


2572 


416WTPRSP 




JlD/O 


1962 




(A/G) 


55046 


3469 


69 41fTWTP^1P 


(l/C) 


55432 


387 




(1/C) 


55539 


108 




/a /r*\ 
(A/G) 


67935 


12397 


6QWTP1P 


(A/G) 


£ZOC CO 

68558 


624 




(T/C) 


78826 


10269 




(C/A) 


92525 


13700 


flow l\^O.r 


(A/G) 


100613 


8089 


flOW 1 Lo?r 


(T/C) 


103182 


2570 


4 1fiWTP87P 
fiOYY H^-o /Jr 


(G/Q 


104612 


1431 




(T/C) 


127782 


23171 




(G/T) 


129002 


1221 


fiQWTP 1 0P 


(A/G) 


129284 


283 




(T/C) 


131285 


2002 J 


416WTPQSP 


(A/1) 


137733 


6449 




(A/G) 


145407 


7675 j 


416WTPQ7P 


(A/U) 




5050 


416WTPRRP 

tIOW A\_>oCT 


(1/Cj 


173663 


23208 


6Q 41 6WTPS1P 

U7.»tlU TT lwil 


(A/G) 


175638 


1976 


543WTP71P 


(Ul) 


1913oo 


15751 


543WTPR0P 

J*TJ TT x V-^-Ov/X 


/T/P\ 


1 OAOO 1 


5604 


543WTC100P 

i «y TT X V*X WX 




zvJoo34 


11844 


543WTP40WP 

J*TJ TT X V'T/ TV X 


(A/O) 


ZlZOZO 


3993 


416WTP92W 

TIU TT X V*-.7^ TT 


KSJi) 


ZZl fui 


8936 


S43WTP109P 


U'<-) 


233484 


11724 


543WTC40P 

J^J TT X V^tvl 


(A/G) 


Z4j4Zz 


11939 


543WTC41P 




Z4.JD3Z 


111 


543WTC31P 


(A/G) 


247247 


1716 


543WTC33P 


(A/G) 


247709 


463 


543WTC34P 


(G/T) 


247827 


119 


DP1011 


(A/G) 


247839 


13 


MX201 


(A/G) 


248388 


550 


543WTC112P 


(T/A) 


249018 


631 


543WTC113P 


(A/G) 


250533 


1516 


543WTC114P 


(C/T) 


250618 


86 


543WTC115P 


(G/C) 


250846 


229 


543WTC116P 


(T/A) 


251962 


1117 



WO 02/086113 




PCT/GB02/01887 



543WTC117P 


(A/G) 


252112 


151 


543WTC90P 


(A/G) 


252117 


6 


543WTC91P 


(T/C) 


252279 


163 i 


543WTC120P 


(G/T) 


252798 


520 


DP1036 


(A/G) 


255834 


3037 


DP1037 


T deletion 


256379 


546 


DP1039 


(C/T) 


256845 


467 


DP1041 


(C7T) 


257046 


202 


543WTC121P 


(T/Q 


257959 


914 


DP1043 


(C/T) 


258094 


136 


543WTC122P 


(A/C) 


259007 


914 


543WTC38P 


(T/Q 


262047 


3041 


543WTC39P 


(G/T) 


262128 


82 


543WTC123P 


(ATT) 


262259 


132 


543WTC124P 


(A/G) 


262881 


623 


543WTC125P 


(T/A) 


265108 


2228 


543WTC24P 


(A/G) 


1 266492 


1385 


543WTC47W 


(A/G) 


266831 


340 


543WTC46WP 


(C/G) 


267295 


465 


543WTC45WP 


(T/C) 


267348 


54' | 


543WTC44WP 


(T/Q 


267449 


102 


543WTC110P 


(A/G) 


267901 


453 


DP1017 


(A/G) 


! 268034 


134 


DP1020 


(T/G) 


268075 


42 


543WTC42P 


(T/C) 


276913 


8839 


543WTC43P 


(T/G) 


277495 . 


583 


543WTC81P 


(A/G) 


282621 


5127 


543WTC101P 


(A/G) 


293696 


11076 


16WTC126P * 


T insertion 


296131 


2436 


16WTC105P ** 


A insertion 


299520 


3390 


16WTC106P 


(A/G) 


301602 


2083 


16WTC104P 


(G/T) 


305609 


4008 


317WTC61P 


(OA) 


317736 


12128 


317WTC62P 


(C/A) 


317891 


156 


317WTC59P 


(T/Q 


318524 


634 


317WTC66P 


(C/A) 


319819 


1296 


317WTC67P 


(A/G) 


320386 ! 


568 


317WTC68P 


(T/Q 


321029 


644 


317WTC55P 


(T/C) 


322250 


1222 


317WTC56P 


(A/Q 


322328 


79 




(T/C) 


323846 


1519 


317WTC108P 


(Am 


324451 


606 


317WTC109P 


(A/G) 


324654 


204 


317WTC127P* 


T insertion 


325873 


1220 


317WTC107P 


(T/C) 


326037 


165 


317WTC60P 


(T/C) 


326869 


833 


317WTC58P 


(A/G) 


329420 


2552 


317WTC64P 


(C/A) 


332092 


2673 
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317WTC65P 




oaoi 17 


ZD 


MX5301 


fPm 




1 coo 


MX5302 


Vw l) 


^^/ini7 

■3J4U1 / 


1U 


317WTC69P 


(CfT\ 


.541 14j 


"11 Of* 

7129 


317WTP70P 




541 4U3 


259 


3 1 7 WTP7dP 




o>iono 1 

3429s i 


1579 


117WTP7<\P 


/ A 


344136 


1156 


1 1 7 WTPQP 


(1/Cj 


04/028 


2893 


317WTPRP ' 




54/Uo3 


56 


317WTP71P 




351373 


4291 


317WTP79P 




354/55 


3383 


3 T7WTP7^P 


(A/Or) 


354991 


237 




{UL) 


359096 


4106 


317WTP76P 




359224 


129 


LYJLA.OOUO 


(A/G) 


359238 


15 


J 1 / W 1 U / Jr 


(A/G) 


368313 


9076 


IVIAOOUI 


(A/G) 


370001 


1689 


jl/WIL//r 


(A/G) 


375401 


5401 


j x l W 1 i^JUJr 


(A/G) 


380193 


4793 


31 /W lCz9r 


(C/A) 


385780 


5588 


-2 1 7TX7TrM /ir> 
j J / W H^i4Jr 


(A/C) [ 


389388 


3609 




(C/A) 


389960 


573 


317WTC16P 


(A/CA 

\t\IKJ) 




OJ 


317WTC17P 


(T/G) 


390153 


112 


317WTC4W 


(A/G) 


403433 


13281 


317WTC28P 


(G/C) 


407126 


3694 


317WTC79P 


(G/C) 


413075 


5950 


317WTC82P 


(C/G) 


415357 


2283 


317WTC25P 


(G/T) 


421402 


6046 











base position given before a set of T's 
base position given before a set of A's 



Note: D2S308- position 261056-261210 
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Table 1b. SNPs genotyped In the LD region. 



SNPNAME 


BASE CHANGE 


POSITION 


i DISTANCE 
BETWEEN 
SNPs 


416WTC84P 


(G/T) 






416WTC98P 


(AJG) 




i IjIIo 


69.416WTC53P 


(TJC) 


•iSd^O 


y44U 


69WTC1P 


fA/G. 




111 1*7 

1J127 


416WTC89P 




IfNIftO 


34625 


69WTC12P 




i oonno 
izyuuz 


25821 


416WTC96P 




i as Am 


16406 


543WTC21P 




1Q1QOQ 


45982 


543WTC100P 




ZU0O34 


17447 


543WTC49WP 


f A/G. 


ZlZoZO 


3993 


543WTC41P 


rr7G. 


O/K^ao 

Z4j0jZ 


32707 i 


543WTC31P 


( A/n\ 


04*70/1*7 
Z4/Z4/ 


1716 


543WTC33P 




1/1*7*7 /"Ml 

Z4 / /U9 


463 


543WTC34P 


ISjfl ) 


0/I7Q07 
Z4/0Z7 . 


119 


DP1011 




0/1*7010 


13 


543WTC112P 


U/A) 


z4yUlo 


1180 


543WTC114P 


frm 


ZDUola 


1601 


543 WTC1 1 5P 




icno/nc 


229 


543WTC90P 


/A/r;\ 

(A/U) 


Kl 1 ! 1 7 
ZjZU / 


1272 


543WTC91P 




ZDZZ/y 


163 


DP1039 




ZD Do4 j 


4567 


DP1043 




oconO/f 


1250 


543WTC122P 




o*\onn7 
zoyuu/ 


914 


543WTC38P 


five. 


zozu** / 


3041 


543WTC39P 


fG/T. 


ZOZIZo 


DO 

oz 


543WTC123P 




zozzoy 


132 


543WTC124P 




ZQZOOl 


OZ3 


543WTC125P 




Of\*i 1 OR 


zzzo 


543WTC24P 




OAM.00 


1 1OC 


543WTC46WP 




Ofi79QC 


oU4 


543WTC45WP 






j4 


543WTC44WP 




Z0/44y 


1U2 


543WTC110P 


fA/G. 




/ICQ 

4Di 


DP1020 


CT/G. 




17C 

1/5 


543WTC42P 


(T/C) 


276913 


8839 


543WTC43P 


(T/G) 


277495 


583 


543WTC81P 


(A/G) 


282621 


5127 


16WTC126P * 


T insertion 


296131 


13511 


16WTC105P ** 


A insertion ' 


299520 


3390 


16WTC106P 


<A/G) 


301602 


2083 


16WTC104P 


(G/T) 


305609 


4008 


317WTC61P 


(C/A) 


317736 


12128 


317WTC62P 


(C/A) 


317891 


156 | 


317WTC59P 


(T/C) 


318524 


634 j 
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317WTP66P 


(UA) 


319819 


1296 


3 1 7 WTP67P 

Jl / VV 1 ^O/r 


(A/U) 


320386 


568 


1 1 7 WTP6RP 
->1 / VV 1 V-OOi 


(1/C) 


321029 


644 


3 17WTP<.SP 


(1A-0 


322250 


1222 


317WTP^6P 


(A/C) 


322328 


79 


3 1 7 WTP^7P 


(T/C) 


323846 


1519 


H7WTP1fiftP 
31 / W i valuer 


(A/T) 


'iO/l /Iff t 

324451 


606 


3 1 7WTr 1 0TP * 


T insertion 


325873 


1423 


3 1 / w 1 y^uuir 


(T/C) 


326869 


997 


Dki W lLOor 


(A/G) 


329420 


2552 


3 1 / VV 1 Uo4r 


(C/A) 


332092 


2673 


31 / W luOOr 


(A/G) 


332117 


26 




(C/T) 


334008 


1892 




(C/T) 


334017 


10 


31/ YV1C09P 


(C/T) 


341145 


7129 


31/ WlC/Ur 


(C/A) 


341403 


259 


31 /WlC/4Jr 


(T/C) 


342981 


1579 


317W1C75P 


(A/G) 


344136 


1156 


317WTC9P 


(T/C) 


347028 


2803 


317WTC71P 


(T/C) 


351373 


4346 


317WTC72P 


(T/C) 


354755 


3383 


1VIX6302 


(C/T) 


359096 


4342 


317WTC76P 


(A/G) 


359224 


129 


"l 1 7WTP7P 


(A/fJ) 


305313 


rvnnn 

9090 


MX6601 


(A/G) 


370001 


1689 


317WTC30P 


(A/G) 


380193 


10193 


317WTC14P 


(A/C) 


389388 


9196 


317WTC16P 


(A/G) 


390042 


655 


3 17WTC28P 


(G/C) 


407126 


17085 


317WTC82P 


(C/G) 


415357 


8232 











* base position given before a set of T's 
** base position given before a set of A's 

Note: D2S308 - position 261056-261210 
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Table 3 

Associations between Asthma and the LnlgE and SNPs 

The A island of LD (Figure 22a,b) Is between SNPs 416WTC84P and DP1039, the B island Is 
between DP1039 and 317WTC59P , the Bii island is between SNPs 317WTC59P and 317WTC74P 
and the C island is between SNPs 317WTC74P and 31 7WTC28P. 



P(TDT) P(QTDT) 
MARKER POSITION Asthma LnlgE 



P(TDT) PCQTDT} 
MARKER POSITION - Asthma' LnlgE 



416WTC84P 

416WTC98P 

416WTC53P 

69WTC36P 

69WTC1P 

416WTC89P 

69WTC12P 

416WTC96P 

543WTC21P 

543WTC100P 

543WTC49WP 

543WTC41P 

543WTC31P 

543WTC33P 

543WTC34P 

DP1011 



30876; ; 




543WTC112P 

543WTC114P 

543WTC115P 

543WTC90P 

543WTC91P 




0:02 



0.004 
0.003 
0.03 

0.01 
0.01 
0.01 

0.02 

0.02 

0.008 

0.02 

0.003 



DP1039 

DP1043 

543WTC122P 

D2S308 

543WTC38P 

543WTC39P 

543^1^123? 

543WTC124P 

543WTC125P 

543WTC24P 

543WTC46WP 

543WTC45WP 

543WTC44WP 



L - f . Jr.'--;- 

tot* 



543WT C110P 

DP1020 

543WTC42P 

543WC43P 

543WTC81P 

16WTC126P 
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16WTC106P 
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317WTC62P 

317WTC59P 



0.0003 
0.00002 



0.003 



f^58073' <fj 
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305609 ; y 

:317735.S.'^ 
, ; 3V78?*^f;^ 
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0.03 
6.02 

.0.02 • 

0.01 
0.009, 



0.01 



:. 31,7WTC6feP. ; 

: 317WTC68P.' 
317AVTC55P 
317%C56P 
31WTG57H , 
V31 J WVTC108P li < 
£ 31^VTC127P ' 

>i.7WX358P • 
»: 31? V WTC64P, 

317W7C65P 
, $1X5301 

MX5302 

317WTG69P 

317WTC70F: i r! 




317WTC82P 



319819 

320386 

321029 

322250 

322328 

323846 

324451 

325873 

326869 

329420 

332092 

332117 

334008 

334017 

341145 

341403 

342981 

344136 

347028 

351373 

354755 

359096 

359224 

368313 

37001 

380193 

389388 

390042 

407126 

415357 
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Table 5. Regions of sequence conserved between human and mouse. Coordinates are 
given with reference to Figure 1. 



Region Id Start Finish base Length % Identity 
base human to 

mouse 


MouseHomol 1 


245976 


246018 


43 


88% 


MouseHomol 2 


248284 


248554 


271 


84% 


EXON1A 


248553 


248776 


224 




MouseHomol 3 


248650 


248791 


142 


90% 


MouseHomol 4 


249808 


249849 


42 


92% 


MouseHomol 5 


250353 


250397 


45 


84% 


MouseHomol 6 


250588 


250724 


137 


87% 


MouseHomol 7 


250807 


251005 


199 


92% 


MouseHomol 8 


251058 


251074 


17 


100% 


MouseHomol 9 


253968 


254001 


34 


88% 


MouseHomol 10 


^54267 


254313 


47 


85% 


MouseHomol 1 1 


255625 


255884 


260 


85% 


MouseHomol 12 


256869 


257017 


149 


85% 


SNP543WTC1 22P 


259007 


259007 


1 




D2s308 


261056 


261315 


260 




MouseHomol 13 


264408 


264477 


70 


94% 


MouseHomol 14 


266668 


267138 


471 


87% 


EXON1B 


267673 


268090 


418 




EXON1C 


269345 


269376 


32 
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Table 6. PSQ assay oligonucleotides and PGR annealing temperatures 
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Table 7. Standard deviation in DPP10 expression levels ascertained by Taqman 
analysis of blood RNA from asthmatics and Controls 



Sample Phenotype 


Standard Deviation (threshold cvcleV* 


Asthmatic 


3.47 


Control 


2.05 



* Quantitative PCR analysis was performed in triplicate on 11 asthmatic 
individuals and 9 control individuals, and the mean threshold cycle determined. 
The standard deviation (in threshold cycles) was calculated using the means of 
each individual in the relevant sample set. 



THIS PAGE BLANK (uspto) 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 
^BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

-I^LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



THIS PAGE BLANK (uspto) 



